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GLYFHOSATE TOLERANT PLANTS 



This is a continuation-in-part of our co-pending 
application having serial number 07/543,236 which was filed on 
June 25, 1990. 

BACKOROt JND OF THE INVRNTTON 

Recent advances in genetic engineering have 
provided the requisite tools to transform plants to contain 
foreign genes. It is now possible to produce plants which have 
unique characteristics of agronomic importance. Certainly, 
one such advantageous trait is more cost effective, 
environmentally compatible weed control via herbicide 
tolerance. Herbicide-tolerant plants may reduce the need for 
tillage to control weeds thereby effectively reducing soil erosion. 

One herbicide which is the subject of much 
investigation in this regard is N-phosphonomethyl-glycine 
commonly referred to as glyphosate. Glyphosate inhibits the 
shikimic acid pathway which leads to the biosynthesis of 
aromatic compounds including amino acids and vitamins. 
Specifically, glyphosate inhibits the conversion of 
phosphoenolpyruvic acid and 3-phosphoshikimic acid to 5- 
enolpyruvyl-3-phosphoshikimic acid by inhibiting the enzyme 5- 
enolpyruvyl-3-phosphoshikimic acid synthase (EPSP synthase 
orEPSPSX 

It has been shown that glyphosate tolerant plants 
can be produced by inserting into the genome of the plant the 
capacity to produce a higher level of EPSP synthase which 
enzyme is preferably glyphosate tolerant (Shah et al«, 1986). 
The introduction into plants of glyphosate degradation gene(s) 
could provide a means of conferring glyphosate tolerance to 
plants and/or to augment the tolerance of transgenic plants 
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already expressing a glyphosate tolerant EPSP synthase 
depending upon the physiological effects of the degradation 
products. 

Glyphosate metabolism (degradation) has been 

5 examined in a wide variety of plants and little degradation has 
been reported in most of those studies. In those instances 
where degradation has been reported, the initial breakdown 
product is usually aminomethylphosphonate (AMPA) 
(Coupland, 1985; Marshall et al., 1987). In these instances, it is 

10 not clear if glyphosate is metabolized by the plant or the 
contaminating microbes on the leaf surface to which glyphosate 
was applied. AMPA has been reported to be much less 
phytotoxic than glyphosate for most plant species (Franz, 1985) 
but not for all plant species (Maier, 1983; Tanaka et al., 1988). 

15 Glyphosate degradation in soils is much more extensive and 
rapid (Torstensson, 1985). The principal breakdown product 
identified is AMPA (Rueppel et al., 1977; Nomura and Hilton, 
1977); a phosphonate that can be metabolized by a wide variety of 
microorganisms (Zeleznick et al., 1963; Mastalerz et al., 1965; 

20 Cook et al., 1978; Daughton et al., 1979a; 1979b; 1979c; Wackett et 
al., 1987a). A number of pure cultures of bacteria have been 
identified that degrade glyphosate by one of the two known 
routes (Moore et al., 1983; Talbot et al., 1984; Sbinabarger and 
Braymer, 1986; Balthazor and Hallas, 1986; Kishore and Jacob, 

25 1987; Wackett et al., 1987a; Pipke et al., 1987a; Pipke et al., 
1987b; Hallas et al., 1988; Jacob et al., 1985 and 1988; Pipke and 
Amrhein, 1988; Quinn et al., 1988 and 1989; Lerbs et al., 1990; 
Schowanek and Verstraete, 1990; Weidhase et al., 1990; Iiu et 
al., 1991). A route involving a "C-P lyase" that degrades 

30 glyphosate to sarcosine and inorganic orthophosphate (Pi) has 
been reported for a Pseudomonas sp. (Sbinabarger and 
Braymer, 1986; Kishore and Jacob, 1987) and an Arthrobacter 
sp. (Pipke et al., 1987b). Pure cultures capable of degrading 



glyphosate to AMPA have been reported for a Flavobacterium 
sp. (Balthazor and Hallas, 1986), for a Pseudomonas sp. (Jacob 
et aL t 1988) and for Arthrobacter atrocyaneus (Pipke and 
Amrhein, 1988). In addition, a large number of isolates that 
convert glyphosate to AMPA have been identified from 
industrial activated sludges that treat glyphosate wastes 
(Hallas et al M 1988). However, the number and nature of 
bacterial genes responsible for these degradations have not been 
heretofore determined nor have the gene(s) been isolated. 

Hence, in one aspect, an object of the present 

invention is to provide novel genes which encode a glyphosate 

metabolizing enzyme which converts glyphosate to 

aminomethylphosphonate and glyoxylate. 

Another object is to enhance the activity of the 

glyphosate metabolizing enzyme against glyphosate by 

replacement of specific amino acid residues. 

Another object of the present invention is to provide 

genetically modified plants which express a gene which 

encodes a glyphosate metabolizing enzyme and which exhibit 

enhanced tolerance to glyphosate herbicide. 

Another object is to demonstrate that a glyphosate 

metabolizing enzyme can be targeted to plastids using 

chloroplast transit peptides and the plastid targeted enzyme 

confers high level glyphosate tolerance. 

A further object is to provide a method for selecting 

transformed plant tissue using the glyphosate metabolizing 

enzyme as the selectable marker in the presence of inhibitory 

concentrations of glyphosate. 

These and other objects, aspects and features of the 

present invention will become evident to those skilled in the art 

from the following description and working examples. 



SUMMARY OF THTC TMW.TsJ77n N 



The present invention provides structural DNA 
constructs which encode a glyphosate oxido-reductase enzyme 
and which are useful in producing glyphosate degradation 
capability in heterologous microorganisms (e.g. bacteria and 
plants) and in producing glyphosate tolerant plants. 

In accomplishing the foregoing, there is provided, 
in accordance with one aspect of the present invention, a 
method of producing genetically transformed plants which are 
tolerant toward glyphosate herbicide, comprising the steps of: 

(a) inserting into the genome of a . plant cell a 
recombinant, double-stranded DNA molecule 
comprising 

(i) a promoter which functions in plant cells to 
cause the production of an RNA sequence, 

(ii) a structural DNA sequence that causes the 
production of an RNA sequence which 
encodes a glyphosate ozidoreductase enzyme, 

(iii) a 3* non-translated DNA sequence which 
functions in plant cells to cause the addition 
of polyadenylated nucleotides to the 3' end of 
the RNA sequence; 

where the promoter is heterologous with respect to 
the coding sequence and adapted to cause sufficient 
expression of said enzyme in plant tissue, including 
meristematic tissue, to enhance the glyphosate 
resistance of a plant cell transformed with said gene; 

(b) obtaining a transformed plant cell; and 

(c) regenerating from the transformed plant cell a 
genetically transformed plant which has increased 
tolerance to glyphosate herbicide. 



In accordance with another aspect of the present 
invention, there is provided a recombinant, double-stranded 
DNA molecule comprising in sequence: 

(a) a promoter which functions in plants to cause the 
production of an RNA sequence; 

(b) a structural DNA sequence that causes the 
production of an RNA sequence which encodes a 
glyphosate oxidoreductase enzyme; and 

(c) a 3' non-translated region which functions in plants 
to cause the addition of polyadenylated nucleotides to 
the 3' end of the RNA sequence. 

There has also been provided, in accordance with another 
aspect of the present invention, bacterial and transformed plant 
cells that contain, respectively, DNA comprised of the above- 
mentioned elements (a), (b) and (c). 

In accordance with yet another aspect of the present 
invention, differentiated plants have been provided that 
comprise transformed plant cells, as described above, which 
exhibit tolerance toward glyphosate herbicide. 

In accordance with still another aspect of the present 
invention, there has been provided a method for selectively 
controlling weeds in a field containing a crop having planted 
crop seeds or plants comprising the steps of: 

(a) planting said crop seeds or plants which are 
glyphosate tolerant as a result of a recombinant 
double-stranded DNA molecule being inserted into 
said crop seed or plant, said DNA molecule having 

(i) a promoter sequence which functions in 
plants to cause the production of an RNA 
sequence, 

(ii) a structural DNA sequence which causes the 
production of RNA which encodes a 
glyphosate oxidoreductase enzyme, 



(iii) a 3* non-translated region which encodes a 
polyadenylation signal which functions in 
plants to cause the addition of polyadenylated 
nucleotides to the 3* end of the RNA sequence, 

where the promoter is heterologous with respect to 
the coding sequence and adapted to cause sufficient 
expression of said enzyme in plant tissue, including 
meristematic tissue, to enhance the glyphosate 
tolerance of a plant cell transformed with said gene; 
and 

(b) applying to said crop and weeds in said field a 
sufficient amount of glyphosate herbicide to control 
said weeds without significantly affecting said crop. 
In a particularly preferred embodiment the double- 
stranded DNA molecule comprising a gene for plant expression 
comprises a structural DNA sequence encoding a fusion 
polypeptide containing an amino- terminal chloroplast transit 
peptide which is capable of causing importation of the carboxy- 
terminal glyphosate oxidoreductase enzyme into the chloroplast 
of the plant cell expressing said gene. 

A further embodiment of the present invention is the 
use of the glyphosate oxidoreductase gene as a selectable 
marker to select and identify transformed plant tissue. 

BRIEF DESCRTPTTON OF TFTR TVRAWT^fffi 

Figure 1 shows the DNA sequence for the full- 
length promoter of figwort mosaic virus (FMV). 

Figure 2 shows the structural DNA sequence for a 
glyphosate oxidoreductase gene from bacterial isolate LBAA. 

Figure 3 shows a comparison of the manipulated 
structural glyphosate oxidoreductase gene versus a modified 



glyphosate oxidoreductase gene adapted for enhanced 
expression in plants. The manipulated glyphosate 
oxidoreductase gene is displayed as the upper DNA sequence. 
Only the changes made in the modified gene are indicated in 
the lower strand of sequences. 

Figure 4 shows a comparison of the manipulated 
structural glyphosate oxidoreductase gene versus a synthetic 
glyphosate oxidoreductase gene adapted for enhanced 
expression in plants. The manipulated glyphosate 
oxidoreductase gene is displayed as the upper DNA sequence. 

Figure 5 shows the structure of pMON17032, a 
pMON886 vector containing the modified glyphosate 
oxidoreductase gene inserted as an En-CaMV35S-modified 
glyphosate oxidoreductase-NOS 3 f cassette into the NotI site of 
the vector. The pMON886 vector is described in the text. 

Figure 6 shows the nucleotide sequence of the 
CTP1 chloroplast transit peptide derived from the A. thaliana 
SSU1A gene. 

Figure 7 shows the genetic/structural map of 
plasmid pMON17066, a pMON979-type vector containing a gene 
encoding a CTP/synthetic glyphosate oxidoreductase fusion 
polypeptide. Related pMON979-type derivatives are pMON17065 
andpMON17073. 

Figure 8 shows the genetic/structural map of 
plasmid pMON17138, an example of a pMON981-type vector 
containing a gene encoding a CTP/synthetic glyphosate 
oxidoreductase fusion polypeptide. In this example the CTP1- 
synthetic glyphosate oxidoreductase gene has been cloned into 
pMON979 as aXbahBamHI fragment 

Figure 9 shows the nucleotide sequence of the 
CTP2 chloroplast transit peptide derived from the A. thaliana 
EPSPS gene. 
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Figure 10 shows the structural map of plasmid 
pMON17159. 

Figure 11 shows the structural map of plasmid 

pMON 17226. 

5 Figure 12 shows the structural map of plasmid 

pMON17164. 

statement op TffF, TNYFiNTT°N 

10 The expression of a plant gene which exists in 

double-stranded DNA form involves synthesis of messenger 
RNA (mRNA) from one strand of the DNA by RNA polymerase 
enzyme, and the subsequent processing of the mRNA primary 
transcript inside the nucleus. This processing involves a 

15 3' signal region which facilitates addition of polyadenylate 
nucleotides to the 3' end of the RNA 

Transcription of DNA into mRNA is regulated by a 
region of DNA usually referred to as the "promoter." The 
promoter region contains a sequence of bases that signals RNA 

20 polymerase to associate with the DNA, and to initiate the 
transcription into mRNA using one of the DNA strands as a 
template to make a corresponding complementary strand of 
RNA 

A number of promoters which are active in plant 
25 cells have been described in the literature. These include the 
nopaline synthase (NOS) and octopine synthase (OCS) 
promoters (which are carried on tumor-inducing plasmids of 
Agrobacterium tumefaciens), the caulimovirus promoters such 
as the cauliflower mosaic virus (CaMV) 19S and 35S promoters 
30 and the figwort mosaic virus (FMV) 35S promoter, the light- 
inducible promoter from the small subunit of ribulose bis- 
phosphate carboxylase (ssRUBISCO, a very abundant plant 
polypeptide). All of these promoters have been used to create 
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various types of DNA constructs which have been expressed in 
plants; see, e.g., PCT publication WO 84/02913 (Rogers et al., 
Monsanto). 

Promoters which are known or are found to cause 

5 transcription of DNA in plant cells can be used in the present 
invention. Such promoters may be obtained from a variety of 
sources such as plants and plant DNA viruses and include, but 
are not limited to, the CaMV35S and FMV35S promoters and 
promoters isolated from plant genes such as ssRUBISCO genes 

10 or the chlorophyll a/b binding proteins. As described below, it is 
preferred that the particular promoter selected should be 
capable of causing sufficient expression to result in the 
production of an effective amount of glyphosate oxidoreductase 
to render the plant substantially tolerant to glyphosate 

15 herbicides. The amount of glyphosate oxidoreductase needed to 
induce the desired tolerance may vary with the plant species. 

It is preferred that the promoters utilized have 
relatively high expression in all meristematic tissues in 
addition to other tissues inasmuch as it is now known that 

20 glyphosate is translocated and accumulated in this type of plant 
tissue. Alternatively, a combination of chimeric genes can be 
used to cumulatively result in the necessary overall expression 
level of glyphosate oxidoreductase enzyme to result in the 
glyphosate tolerant phenotype. 

25 The mRNA produced by a DNA construct of the 

present invention also contains a 5' non-translated leader 

sequence. This sequence can be derived from the promoter 

* 

selected to express the gene, and can be specifically modified so 
as to increase translation of the mRNA. The 5' non-translated 
30 regions can also be obtained from viral RNAs, from suitable 
eukaryotic genes, or from a synthetic gene sequence. The 
present invention is not limited to constructs, as presented in 
the following examples*, wherein the non-translated region is 
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derived from both the 5* non-translated sequence that 
accompanies the promoter sequence and part of the 5' non- 
translated region of the virus coat protein gene. Rather, the 
non-translated leader sequence can be derived from an 
unrelated promoter or coding sequence as discussed above. 

A preferred promoter for use in the present 
invention is the full-length transcript (35S) promoter from the 
figwort mosaic virus (FMV) which functions as a strong and 
uniform promoter for chimeric genes inserted into plants, 
particularly dicotyledons. In general, the resulting transgenic 
plants express the protein encoded by the inserted gene at a 
higher and more uniform level throughout the tissues and cells 
than the same gene driven by an enhanced CaMV35S promoter. 
Referring to Figure 1, the DNA sequence of the promoter is 
located between nucleotides 6368 and 6930 (SEQ ID NO:l) of the 
FMV genome. A 5" non- translated leader sequence is 
preferably coupled with the promoter and an exemplary leader 
sequence (SEQ ID NO:2) is shown in Figure 1. The leader 
sequence can be from the FMV genome itself or can be from a 
source other than FMV. 

The 3' non-translated region of the chimeric plant 
gene contains a polyadenylation signal which functions in 
plants to cause the addition of polyadenylate nucleotides to the 
3' end of the RNA. Examples of suitable 3' regions are (1) the 
3' transcribed, non-translated regions containing the 
polyadenylation signal of Agrobacterium tumor-inducing (Ti) 
plasmid genes, such as the nopaline synthase (NOS) gene, and 
(2) plant genes such as the soybean storage protein genes and 
the small subunit of the ribulose-l,5-bisphosphate carboxylase 
(ssRUBISCO) gene. An example of a preferred 3' region is that 
from the ssRUBISCO gene from pea (E9), described in greater 
detail in the examples below. 
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The DNA constructs of the present invention also 
contain a structural coding sequence in double-stranded DNA 
form, which encodes a glyphosate oxidoreductase enzyme 
which converts glyphosate to aminomethylphosphonate and 
glyoxylate. 

Summary of the Glyphosate Oxidoreductase Reaction 

The enzyme glyphosate oxidoreductase catalyzes 
the cleavage of the C-N bond of glyphosate yielding 
aminomethyl phosphonate (AMPA) and glyoxylate as the 
reaction products. Under aerobic conditions, oxygen is utilized 
as a cosubstrate for the reaction. Other electron carriers such 
as phenazine methosulfate and ubiquinone stimulate the 
reaction under aerobic conditions. In the absence of oxygen, 
these compounds act as electron acceptors. 

The enzymatic reaction can be assayed by oxygen 
uptake using an oxygen electrode. The glyphosate oxido- 
reductase from LBAA does not produce hydrogen peroxide as a 
product of oxygen reduction. This enzyme has a stoichiometry 
of two moles of glyphosate oxidized per mole of oxygen 
consumed and produces two moles each of AMPA and 
glyoxylate as reaction products. 

An alternate method for the assay of glyphosate 
oxidoreductase involves reaction of the sample with 2,4- 
dinitrophenylhydrazine and determination of the amount of the 
glyoxylate-2,4-dinitrophenylhydrazone by HPLC analysis as 
described in detail in a later section. 

A third method for the assay of glyphosate 
oxidoreductase consists of using [3-"C]-glyphosate as a 
substrate; the radioactive AMPA produced by the enzyme is 
separated from the substrate by HPLC on anion exchange 
column as described later. The radioactivity associated witn 
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AMPA is a measure of the extent of the glyphosate 
oxidoreductase reaction. 

Glyphosate oxidoreductase from LBAA is a 
flavoprotein using FAD as a cofactor. One of the mechanisms 
we have proposed for the reaction catalyzed by this enzyme 
involves the reduction of the FAD at the active site of the enzyme 
by glyphosate. This leads to the formation of reduced FAD and 
a Schiff base of aminomethylphosphonate with glyoxylate. The 
Schiff base is hydrated by water and hydrolyzed to its 
components, AMPA and glyoxylate. The reduced flavin is 
reoxidized by molecular oxygen. We suggest that during the 
process of reoxidation of reduced FAD, an oxygenated flavin is 
produced as an intermediate. This flavin intermediate may 
catalyze the oxygenation of glyphosate yielding AMPA and 
glyoxylate. This hypothesis is in accordance with the observed 
stoicbiometry and our inability to detect hydrogen peroxide in 
the reaction mixture. 

In addition to glyphosate, glyphosate oxido- 
reductase from LBAA oxidizes iminodiacetic acid (IDA) to 
glycine and glyoxylate. The rate of the reaction with IDA is 
significantly faster than with glyphosate. 

Isolation of Efficient fllvnhoRate-tn-AMPA Detmirfinp 

Bacterium 

Bacteria capable of degrading glyphosate are 
known. (Hallas et al., 1988; Malik et al., 1989). A number of 
these bacteria were screened for the rapid degradation of 
glyphosate in the following manner; twenty three bacterial 
isolates were transferred from TSA (Trypticase Soya Agar; 
BBL) plates into medium A consisting of Dworkin-Foster salts 
medium containing glucose, gluconate and citrate (each at 
0.1%) as carbon source and containing glyphosate at 0.1 mM as 
the phosphorous source? 
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Dworkin-Foster minimal medium was made up by 
combining in 1 liter (with autoclaved H2O) 1 ml each of A, B and 
C and 10 ml of D, thiamine HC1 (5 mg), C-sources to final 
concentrations of 0.1% each and P-source (glyphosate or other 
phosphonates or Pi) to the required concentration: 



A. D-F Salts (1000X stock; per 100 ml; autoclaved): 

H3BO3 1 mg 

MnS04.7H20 1 mg 
ZnS04.7H20 12.5 mg 
CuS04.5H20 8 mg 
NaMo03.3H20 1.7 mg 

B. FeSO4.7H20 (1000X stock; per 100 ml; autoclaved) 

0.1 g 

C. MgSO4.7H20 (1000X stock; per 100 ml; autoclaved) 

20g 

D. (NH4)2S04 (100X stock; per 100 ml; autoclaved) 

20g 



Yeast Extract (YE; Difco) was added to a final concentration of 
0.01 or 0.001%. 

Each 1 ml of culture medium also contained 
approximately 200,000 cpm [3-i4C]glyphosate (Amersham; 
CFA.745). The cultures were incubated with shaking at 30°C. 
Isolate LBAA showed significant growth at day one, while 
other test cultures showed little growth before day three. 
Determination of radioactivity (by scintillation counting) in the 
culture, cell pellet and culture supernatant (at day 4) revealed 
that total 14C radioactivity had decreased and that remaining 
was partitioned -1:1 in the supernatant and pellet, indicating 
that significant uptake and metabolism of glyphosate had taken 
place. 
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TABLE I - Glyphosate Metabolism hv t.raa friilturf 



Sample 
control 

LBAA culture 
LBAA supernatant 
LBAA cells 



18,631 
11,327 
6,007 
4332 



1 4C mm 



At day five, 75 ul of the culture supernatant of all test 
cultures was analyzed by HPLC as follows: a SYNCHROPAK™ 
AX100 anion exchange column (P.J. Cobert) was used and the 
mobile phase consisted of 65 mM KH 2 P0 4 (pH5.5 with NaOH; 
depending on the needs of the experiment the concentration of 
the phosphate buffer was varied from 50 to 75 mM in order to 
alter the retention times of the material), run isocratically and 
the eluted material monitored continuously using a radioactive 
detector. This analysis revealed, in one isolate in particular 
(LBAA), that the glyphosate peak (Retention Time [RT] = 7.0 
minutes in this analysis) was completely absent and a new 
peak of radioactivity had appeared, with the sam e RT as 
methylamine or N-acetylmethylamine (RT = 3.5 minutes). The 
collection of bacteria, of which strain LBAA formed a part, had 
been characterized as degrading glyphosate to AMPA (Hallas et 
al., 1988); the detection of methylamine or N-Acetylmethyl- 
amine suggested that the AMPA or N-AcetylAMPA was being 
metabolized by the LBAA M C-P lyase" activity to release the 
phosphate required for growth in this experiment. Strain 
LBAA was examined in greater detail. 

Conversion of Glyphosate to AMPA in Micr obial Isolate* 

For clarity and brevity of disclosure, the following 
description of the isolation of genes encoding glyphosate 
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oxidoreductase enzymes is directed to the isolation of such a 
gene from a bacterial isolate (LBAA). Those skilled in the art 
will recognize that the same or a similar strategy can be 
utilized to isolate such genes from other microbial isolates. 

The glyphosate degradation pathway was 
characterized in resting cells of glyphosate-grown strain LBAA 
as follows: the cells from a 100 ml culture of LBAA, grown in 
DF medium with glucose, gluconate and citrate as carbon 
sources and with thiamine and Yeast Extract (0.01%) to supply 
trace requirements (= medium DF3S) and with glyphosate at 
0.2 mM as a phosphorous source, were harvested at Klett = 200, 
washed twice with 20 ml of DF3S medium and the equivalent of 
20 ml cells resuspended in 100 ul of the same medium 
containing [3-uC]glyphosate (2.5 ul of 52 mCi/mmol). The cell 
mix was incubated at 30°C with shaking and samples (20 ul) 
were withdrawn at intervals. The samples were centrifuged 
and both the supernatant and cell pellets were analyzed by 
HPLC (the cell pellets were resuspended in 100 ul of acid-DF3S 
[= DF3S, 0.65N HC1], boiled for 5 minutes, centrifuged briefly 
and this supernatant was analyzed; an acidified glyphosate 
control was also examined). In two hours the amount of 
radioactivity in the glyphosate peak (RT = 7.8 minutes) in the 
supernatant had decreased to -33% of the starting level; about 
3% of the glyphosate was found within the cell. Material 
co-eluting with the methylamine standard accounted for -5% of 
the starting counts in the supernatant and for -1.5% in the cell 
pellet. A new peak, accounting for -1.5$ of the starting 
radioactivity with a RT of 7.7 minutes (glyphosate RT = 8.9 
minutes upon acidification in this experiment) was identified 
in the cell contents. The large decrease in overall radioactivity 
also suggested that the glyphosate was extensively metabolized 
in this experiment. The pathway was elucidated further in a 
subsequent experiment where the metabolism of [^CjAMPA 
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was compared to that of [3-i4C]glyphosate (as above) in resting 
cells harvested at Hett 165 and resuspended at the equivalent to 
15 ml cells per 100 ul DF3S medium. The samples were 
analyzed by HPLC and consisted of whole cultures acidified and 
treated as described above. Within the first two hours of the 
glyphosate experiment, 25% of the radioactivity was found in 
the methylamine/N-acetylmethalamine peak (RT = 4.8 
minutes), 12.5% as AMPA (RT = 6.4 minutes), 30% as the peak 
alluded to above (RT = 9.4 minutes) and 30% as glyphosate (RT = 
11.8 minutes). In the AMPA experiment 15% of the 
radioactivity was found as N-acetylmethylamine/methyl- 
amine, 59% as AMPA and 18% in the peak with RT = 9.4 
minutes. The modified form of AMPA was identified as 
N-acetylAMPA. A similar acetylation step has been inferred 
from the products identified in E. coli growing in 
aminomethylphosphonates as sole sources of P (Avila et al., 
1987). These data indicated that the glyphosate degradation 
pathway in LBAA is glyphosate AMPA methylamine) 
N-acetylAMPA N-acetylmethylamine. 

Cloning Of the Glyphosate Oxidoreduftfaiae Gene(*\ in F. nnli 

Having established the glyphosate-to-AMPA 
conversion in strain LBAA, a direct approach for the cloning of 
the gene(s) involved in this conversion into E. coli was 
investigated. Cloning and genetic techniques, unless otherwise 
indicated, were generally those described (Maniatis et al., 1982). 
Th6 cloning strategy was as follows: introduction of a cosmid 
bank of strain LBAA into E. coli and selection for the 
glyphosate-to-AMPA gene(s) by requiring growth on glyphosate 
as a phosphorous (P) source. This selection relied on the use of 
AMPA generated by the glyphosate metabolizing enzyme as a P 
source, following the release of the Pi from the AMPA by the 
E. coli "C-P lyase." Most E. coli strains are incapable of 
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utilizing phosphonates as P sources upon initial challenge, 
however these strains usually adapt rapidly, independently of 
RecA, to utilize phosphonates (become Mpu+) (Wackett et al., 
1987b). E. coli Mpu* was isolated from E. coli SR200 (Leu-, Pro-, 

5 recA, hsdR, supE, Smr, tonA,) as follows: an aliquot of a fresh 
L-broth culture of E. coli SR200 was plated on MOPS (Neidhardt 
et al., 1974) complete agar (i.e., contains L-leucine and L- 
proline at 25 ug/ml and vitamin Bl [thiamine] at 10 ug/ml; agar 
= DIFCO "Purified") containing aminomethylphosphonate 

10 (AMPA; 0.2 mM; Sigma) as P source. 

MOPS medium is: 
10 ml 10X MOPS SALTS 
2 ml 0.5 mg/ml Thiamine HC1 
15 1 ml 20% glucose 



10 X MOPS Salts are: 
for 100 ml 



25 



40 ml 


lMMOPSpH7.4 


4ml 


1M Tricine pH7.4 


1ml 


0.01MFeSO 4 .7H2O 


5ml 


I.9MNH4CI 


1ml 


0.276 MK2SO4 


1ml 


0.5 mM CaCl 2 


1ml 


0.528 M MgCl 2 


10 ml 


5MNaCl 


1ml 


0.5% L-Methionine 



lml Micronutrients 



30 
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Micronutrients are: 

3x10-9M(NH4)6MhtO24 

4xl0-7MH3BO 4 

3xlO-«MCoCl2 

1.6 x 10-8 MCuS0 4 

8xl0-*MMnCl 2 

lxlO-«MZnS0 4 

Six individual colonies were picked from this plate after three 
days incubation at 37°C and streaked on MOPS complete agar 
containing either AMPA or methylphosphonate (Alfa) as P 
source. One colony, designated E. coli SR200 Mpu+, was chosen 
from those that grew equally and uniformly on both 
phosphonate media. 

Chromosomal DNA was prepared from strain LBAA 
as follows: The cell pellet from a 100 ml L-Broth (Miller, 1972) 
late log phase culture of LBAA was resuspended in 10 ml of 
Solution I (Birnboim and Doly, 1979). SDS was added to a final 
concentration of 1% and the suspension was subjected to three 
freeze-thaw cycles, each consisting of immersion in dry ice for 
15 minutes and in water at 70°C for 10 minutes. The lysate was 
then extracted four times with equal volumes of 
phenokchloroform (1:1; phenol saturated with TE) (TE = lOmM 
Trie pH8.0; l.OmM EDTA) and the phases separated by 
centrifugation (15000g; 10 minutes). The ethanol-precipitable 
material was pelleted from the supernatant by brief 
centrifugation (8000g; 5 minutes) following addition of two 
volumes of ethanol. The pellet was resuspended in 5 ml TE 
and dialyzed for 16 hours at 4°C against 2 liters TE. This 
preparation yielded a 6 ml DNA solution of 150 ng/ml. 

Partially-restricted DNA was prepared as follows: 
Three 100 ug aliquot samples of LBAA DNA were treated for 1 
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hour at 37°C with restriction endonuclease HindUl at rates of 
4, 2 and 1 enzyme unit/ug DNA, respectively. The DNA 
samples were pooled, made 0.25 mM with EDTA and extracted 
with equal volume of phenohchloroform. Following the 

5 addition of NaAcetate and ethanol, the DNA was precipitated 
with two volumes of ethanol and pelleted by centrifugation 
(12000 g; 10 minutes). The dried DNA pellet was resuspended 
in 500 ul TE and layered on a 10-40% Sucrose gradient (in 5% 
increments of 5.5 ml each) in 0.5 M NaCl, 50 mM Tris pH8.0, 5 

10 mM EDTA. Following centrifugation for 20 hours at 26,000 rpm 
in a SW28 rotor, the tubes were punctured and 1 ml fractions 
collected. Fifteen ul samples of each third fraction were run on 
0.8 % agarose gel and the size of the DNA determined by 
comparison with linearized lambda DNA and Hindlll-digested 

15 lambda DNA standards. Fractions containing DNA of 25-35 kb 
fragments were pooled, desalted on AMICON10 columns (7000 
rpm; 20°C; 45 minutes) and concentrated by precipitation. This 
procedure yielded 50 ug of LBAA DNA of the required size. 

Plasmid pHC79 (Hohn and Collins, 1980) DNA and a 

20 Hin dm-phosphatase treated vector was prepared as described 
elsewhere (Maniatis et al., 1982). The ligation conditions were 
as follows: 



25 



30 
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Vector DNA {HindKl- and calf 
alkaline phosphatase-treated) 



1.6 ug 



Size fractionated LBAA 
iftndlll fragments 



3.75 jig 



10X ligation buffer 



2.2 ul 



10 



15 



20 



25 



250 mM Tris-HCl, pH 8.0; 
100 mM MgCl 2 ; 

100 mM Ditbiothreitol; 
2 mM Spermidine 

T4DNAligase 
(Boehringer-Mannheim) 

(400 units/ul) i.o ul 

H2Oto22.0ul 
18 hours at 16°C. 

The ligated DNA (4 ul) was packaged into lambda phage 
particles (Stratagene; Gigapack Gold) using the 
manufacturer's procedure. 

E. coli SR200 Mpu+, grown overnight in L-Broth (with 
maltose at 0.2%), was infected with 50 ul of the packaged DNA. 
Transformant8 were selected on MOPS complete agar plus 
ampicillin and with glyphosate at 0.2 mM as P source. 

Aliquot samples were also plated on MOPS 
(Neidhardt et al., 1974) complete agar plus ampicillin 
containing Pi at ImM to titer the packaged cosmids. Cosmid 
transformants were isolated on this latter medium at a rate of 
-105 per ug/LBAA HindHI DNA after 2 days at 37°G. Colonies 
arose on the glyphosate-agar from day 3 until day 10 with a final 
rate of 1 per 200-300 cosmids. Plasmid DNA was prepared from 
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twenty one cosmid transfonnants from the glyphosate plates. 
These cosmids fell into at least two classes based on the HindUl 
restriction pattern of the plasmid DNA. In Class I f all the 
cosmids had cloned 6.4 and 4.2 kb Hindlll restriction 

5 fragments in common and in Class II, a -23 kbp fragment in 
common. Ten cosmids, representative of the diversity of the 
cloned fragments, were re-transformed into E. coli SR200 Mpu+ 
and the glyphosate utilization trait verified by selection for 
growth on MOPS complete agar plus ampicillin plus glyphosate 

10 plates. The final cell density achieved by the cultures using 
glyphosate (0.2mM in MOPS medium) as a P source was also 
determined and little difference could be discerned between the 
different transfonnants. Transfonnants were also inoculated 
into MOPS complete broth with AMPA at 0.1 mM as P source 

15 (to ensure the presence of M C-P lyase" activity) and after 24 
hours at 37°C were diluted 100-fold into MOPS complete 
medium with glyphosate at 0.1 mM and [3-nC]glyphosate 
(40,000 cpm/ml). All the cosmid-containing cells degraded 
glyphosate and generated N-acetylAMPA and N-acetylmethyl- 

20 amine, with no great difference in the rate. The 
N-acetylAMPA was found in the culture supernatant in these 
tests. One cosmid from Class I, identified as pMON7468, was 
chosen for further study* A second glyphosate oxidoreductase 
gene has been identified from a Class II cosmid clone. 

25 Cell-free lysates E. coli SB200 Mpu+/pMON7468 were 

prepared from cells grown on MOPS complete medium with 
glyphosate at 1.0 mM (and supplemented with 
L-phenylalanine, L- tyrosine and L-tryptophan each at 100 
jig/ml and with para-hydroxybenzoic acid, 2,3-dihydroxy- 

30 benzoic acid and para-aminobenzoic acid each at 5 jig/ml to 
minimize the effects of inhibition of the E. coli EPSP synthase). 
The cell pellet (approx. 0.5 g wet weight) was resuspended in 1 
ml of lysis buffer (40 mM MOPS, pH7.4; 4 mM Tricine, pH 7.4; 
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10% glycerol; 1 mM DTT) and passed twice through a French 
Press. The cell debris was removed by centrifugation at 15000 
rpm for 10 minutes. The supernatant was assayed, following 
addition of MgCl 2 to 10 mM, for degradation of radiolabeled 
glyphosate. The glyphosate substrate was supplied as 
[3-i4C]glyphosate (final concentration = 17 uM). The products 
observed were predominantly AMPA and some 
N-acetylAMPA; the production of AMPA is indicative of the 
cloned enzymatic activity from strain LBAA but the 
N-acetylAMPA could be due to endogenous E. coli activities 
(Avila et al., 1987). The specific activity for AMPA formation 
under these conditions was 13.3 pmoles AMPA/minute.mg 
protein. 

Characterization of the Olvphoaatft-t.n.AMPA ft» nft 

The cloned region responsible for this glyphosate 
oxidoreductase enzymatic activity was then localized in the 
cosmid. Deletions of pMON7468 were isolated, primarily within 
the cloned region, by using restriction enzymes that cut 
infrequently within the insert, as follows: plasmid DNA 
samples of 0.5 - 2 ug were digested to completion with 
restriction endonucleases ivorl, Sac I, Bglll or BamHI, 
extracted with phenol:chloroform, ethanol precipitated, 
resuspended in TE buffer and ligated for 2-4 hours at room 
temperature (or for 18 hours at 16°C) in a final volume of 50 ul 
with ligation buffer and T4 DNA ligase. Tranaformants were 
selected in E. coli SR200 Mpu+ and these deletions were 
examined for loss or retention of the glyphosate utilization 
phenotype. These data, in conjunction with restriction 
mapping of the clones, were used to localize the active region to 
near the central portion of the insert in pMON7468 that 
included the two common fTindlll fragments (6.4 and 4.2 kb). 
The HindJH restriction fragments from this region were then 
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subcloned into pBlueScript (Stratagene) and their glyphosate 
phenotype determined in E. coli JM101 Mpu+ (the Mpu+ 
derivative of JM101 was isolated as described for SR200 Mpu+). 
Clones containing the 6.4 kb Hindlll fragment, in either 
orientation, resulted in glyphosate utilization. Following 
restriction mapping of this HindUl fragment, a series of 
deletion clones were isolated from the two 6.4 kb HindJll clones 
using enzymes that cut infrequently in the insert and also in 
the polylinker region. A number of restriction fragments 
internal to the Hindlll fragment were also subcloned. The 3.5 
kb Pstl and 2.5 kb BgVfl fragments, in either orientation, were 
positive for glyphosate utilization. These data, combined with 
those from the deletions, were used to localize the active region 
to an approximately 1.8 kb BglU-XhoI fragment. In addition, 
deletions isolated from the 6.4 kb HindUl fragment indicated a 
minimum coding region size of around 0.7 kb, yrith the EcoRI 
and Sad sites probably located within the coding sequences* 

The direction of transcription/expression of the locus 
responsible for the glyphosate- to-AMP A enzymatic activity was 
determined as follows: E. coli JM101 Mpu+ transformants of 
pMON7469 #1 and #4 (Clones of the 2.5 kb BglU fragment in 
the BamHI site of pUCH8; opposite orientations) were grown in 
M9-glucose-thiamine- ampicillin broth, with and without the 
Plac inducer IPTG, harvested in late log phase (Klett 190-220), 
cell-free lysates of the four cultures were prepared as described 
above and were assayed for glyphosate-to-AMPA activity with 
glyphosate at 17 jiM. The highest enzymatic activity was 
obtained for pMON7469 #1 plus IPTG, where the Xhol site is 
distal to the Plac, suggesting that the gene(s) were expressed in 
the Bglll-to-XhoI direction. 
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TABLE n - GlvohosatP tQ A MPA ArtiviH, j n 
Cell-Frfift T,VSafrS Of?? mli Transform*^ 

5 IPTG Specific Activity 

Clans addsd nmoies ampa ^ n mr 

pMON7469#l no <3.o 

pMON7469#l yes 32.O 

10 pMON7469#4 no <3.o 

pMON7469#4 yes <3.o 



The only product observed was AMPA, suggesting that the 
AMPA acetylating activity that was described earlier had been 
15 induced in E. coli transform ants growing on glyphosate as the 
P source. 

In a later experiment, cell lysates of pMON7469#l 
and pMON7470 (Bglll-Xhol 1.8 kb in pUC118; formed from 
PMON7469 #1 by deletion of the - 700 b^Xhol-Sall fragment) 
20 were assayed for glyphosate-to-AMPA activity with glyphosate 
at 2 mM (Sp. Act. [3-"C]glypho8ate ■ 3.7 mCi/mmol; 0.2 
uCi/reaction; cultures grown with IPTG in medium) and much 
higher enzymatic activities were recorded, reflecting the 
improved assay conditions. 

25 

TABLE TTT - Glyphosate to AMPA Activity in 

Cell-Free Lvsafrs of F, mli Transform^ 

Specific Activity 
nmoles AMPA/min 
15.04 
7.15 



30 Clone 

pMON7469#l 
pMON7470 
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The proteins encoded by the BglU fragment were 
determined in vivo using a T7 expression system (Tabor and 
Richardson, 1985) following cloning of this fragment into the 
BamKl site in the vector pBlueScript (+) (pMON7471 #1, #2; 
opposite orientations). Test and control plasmids were 
transformed into E. coli K38 containing pGPl-2 (Tabor and 
Richardson, 1985) and grown at 30°C in L-broth (2 ml) with 
ampicillin and kanamycin (100 and 50 ug/ml, respectively) to a 
Klett reading of - 50. An aliquot was removed and the cells 
collected by centrifugation, washed with M9 salts (Miller, 1972) 
and resuspended in 1 ml M9 medium containing glucose at 
0.2%, thiamine at 20 ug/ml and containing the 18 amino acids 
at 0.01% (minus cysteine and methionine). Following 
incubation at 30°C for 90 minutes, the cultures were 
transferred to a 42°C water bath and held there for 15 minutes. 
Rifampicin (Sigma) was added to 200 ug/ml and the cultures 
held at 42°C for 10 additional minutes and then transferred to 
30°C for 20 minutes. Samples were pulsed with 10 uCi of 
35S-methionine for 5 minutes at 30°C, the cells collected by 
centrifugation and suspended in 60-120 ul cracking buffer (60 
mM Tris-HCl 6.8/1% SDS/1% 2-mercaptoethanol/10% 
glycerol/0.01% bromophenol blue). Aliquot samples were 
electrophoresed on 12.5% SDS-PAGE and following soaking for 
60 minutes in 10 volumes of Acetic Acid-Methanol-water 
(10:30:60), the gel was soaked in ENLIGHTNING ™ (DUPONT) 
following manufacturer's directions, dried, and exposed at 
-70"°C to X-Ray Film. Proteins labeled using ^-methionine 
were detected only for the Bglll-to-Xhol direction, the largest 
about 45 kd in size. When the Bglll-Xhol fragment was 
examined following cloning into the BamUl-Xkol sites of 
pBlueScript (to form pMON7472), this -45 kd protein was still 
expressed. 
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The effect of expression of the glyphosate-to-AMPA 
activity on glyphosate tolerance of E. coli was determined 
initially by examining the growth of recombinants in media 
containing inhibitory concentrations of glyphosate. The test 

5 compared the growth of E. coli JM101 containing a control 
vector (pUC118; Viera and Messing, 1987) or the pUC118 clones 
of the 2.5 kb BglU fragment (pMON7469 #1, #4). There was a 
very clear correlation between the glyphosate-utilization ability 
and glyphosate tolerance. This tolerance phenotype (resistance 

10 to 15 mM glyphosate) was then employed as a screen to quickly 
monitor for the phenotype of deletion clones such as pMON7470 
(Bglll-Xhol 1.8 kb in pUCH8; formed from pMON7469 #1 by 
deletion of the - 700 b^Xhol-Sall fragment) and later clones. 

15 Nucleotide Sequence of the Structural filvphnafltft 

Qadjasdafitaae fiftnf 

The nucleotide sequence of the Bglll-Xhol fragment 
(SEQ ID NO:3) was determined using single-stranded DNA 
templates (generated .using the phagemid clones and the 

20 "helper" M13 phage R408) and the commercially available 
SEQUENASE ™ (International Biotechnologies, Inc.) kit. 
Computer analysis of the sequence (SEQ ID NO:3) revealed a 
single large open reading frame (ORF) in the BglU to Xhol 
direction and is presented in Figure 2 which includes the 

25 location of some of the relevant restriction sites. The putative 
stop codon (UAA) was located 2 bp 5' of the Seal restriction cut 
site. Data to confirm that this UAA codon was the termination 

« 

codon of the -45 kd ORF were derived as follows: previously the 
3' limits had been determined, based on the glyphosate 
30 utilization phenotype, to be between the Sad site (95 bp 
upstream of the Seal site) and the Xhol site. When the 
Bglll-Scal fragment was cloned into the BaroHI-Sroal sites of 
pBlueScript and the proteins expressed in vivo, the -45 kd 
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protein was still produced. The BgllhScal fragment was then 
recloned from this pBlueScript clone as Xbal-HindHI into 
pUCH8 Xbal-Hindni and was found to confer resistance to 15 
mM glyphosate to E. coli JM101 transformants. These data 
located the C-terminus of the --45 kd protein between the Sad 
and Seal sites. The only stop codon, in any reading frame, 
between these sites is that immediately upstream of the Seal 
site. 

There were two methionine codons (AUG; located at 
positions 120 and 186) that if used as the fMet would give rise to 
-proteins of 46.140 and 44.002 kd, respectively, but neither was 
preceded by a clearly recognizable Shine-Dalgarno sequence. 

The start of the protein was delineated more precisely 
as follows: BglU restriction site recognition sequences were 
introduced at positions upstream of the two potential start 
codons by site-directed mutagenesis of pMON7470, substituting 
AGATCT for the sequences AGACTG ("Bgl20 M ) and GTATGC 
("Bgl86"), 21 and 9 bp upstream of the AUG120 and AUG 186, 
respectively. Except where noted, oligonucleotide primers for 
mutagenesis comprised the sequences to be altered flanked by 8- 
10 homologous bases on each side. The glyphosate tolerance 
was determined for the mutated clones. Introduction of the 
BglU site upstream of AUG120 had no effect on glyphosate 
tolerance while it was abolished by the mutagenesis that 
introduced the BglU site upstream of AUG186. The effects of 
these mutageneses on the ~ 45 kd protein were examined by 
subcloning the mutated sequences into T7 expression vectors 
using a site in the polylinker of pMON7470 (Kpnl), just 
upstream of the original BglU site, and the downstream 
HindJU site. This complete fragment was recloned into 
pl8UT3T7 (PHARMACIA) Kpnl-HindUI and tested in vivo as 
described above. The ~ 45 kd protein was still expressed and a* 
comparable levels from both of the "BglU" mutagenized 
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sequences. When the new BglU sites were used as 5* ends (and 
the downstream HindUI site) for cloning into the pBlueScript 
BamUI-HindUI sites, the -45 kd protein was still expressed 
when the new BglU site upstream of AUG120 served as 5' end, 
but not when that located upstream of AUG186 was the 5' end. 
These data suggest strongly that the AUG120 (or some codon 
located very close to it) is the N-terminus of the glyphosate oxido- 
reductase protein. The BglU site introduced upstream of the 
AUG186 did not result in a prematurely terminated or highly 
unstable protein and suggests that the predicted coding 
sequence changes resulting from this mutagenesis 
(Val 18 -Cysi9 -> Argia-Alaig) had severe effects on the activity of 
the enzyme. 

Further data to confirm the location of the 
N-terminus were obtained by introducing separately (by 
mutageneses of pMON7470), an Ncol restriction site 
recognition sequence (CCATGG for CTATGT; changes the 
second codon from Serine to Alanine) or an Ndel sequence 
(CATATG for CCTATG) at AUG120 and expressing this ORF 
using efficient E. coli expression vectors. The expression of the 
Ndel version is outlined here: the Ndel-HindUI fragment, 
beginning at the putative AUG, was cloned into pMON2123 
(Ndel-HindUl) replacing the ompF-IGF-1 fusion fragment 
(Wong et at, 1988). The resultant clone was introduced into 
E. coli JM101 and the cells induced with nalidixic acid as 
described (Wong et al., 1988) for 2 hours. The resultant protein 
was indistinguishable in size from the -45 kd protein on SDS 
PAGE and a cell lysate from an induced culture had a 
glyphosate oxidoreductase specific activity of 12.8 nmoles 
AMPA/min.mg. When compared in a separate experiment, no 
differences were observed for the glyphosate oxidoreductase 
activity when the second codon was Alanine instead of Serine. 
The structural DNA sequence for the glyphosate oxidoreductase 
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enzyme (SEQ ID N0:4) begins at nucleotide 120 and ends at 
nucleotide 1415 of the Zfc/II-XhoI fragment of Figure 2 and the 
glyphosate oxidoreductese enzyme consists of 431 amino acids 
(SEQIDNO:5). 

Construction of Glvnhosate Oxidoreductase Plant Opha 
Transformation Vectors 

To facilitate the manipulation of the structural 
glyphosate oxidoreductase gene, the internal JUcoRI and JVcol 
restriction site recognition sequences were removed by site- 
directed mutagenesis to substitute the sequence GAATTT for 
GAATTC and CCACGG for CCATGG, respectively. A 
glyphosate oxidoreductase coding sequence suitable for 
introduction into and expression in plant transformation 
vectors was assembled in the following way: the Mrol ("Met-Ala- 
n ) N-terminus was combined with the iVcol- and JScoRI-deleted 
coding sequences, and the C-terminus deleted to the Seal site, 
in a number of cloning steps using the internal Sphl and 
EcoRV restriction sites. In these steps a BglJl site was located 
immediately upstream of the Ncol site and EcoBI and ifindlll 
sites were located immediately downstream from the stop 
codon. The sequence of this manipulated glyphosate 
oxidoreductase gene (SEQ ID NO:6) is shown in Figure 3. The 
manipulated glyphosate oxidoreductase gene still codes for the 
wild-type glyphosate oxidoreductase protein. The 
manipulations do not alter the amino acid sequence of the 
glyphosate oxidoreductase. This glyphosate oxidoreductase 
structural sequence (SEQ ID NO:6), as a BgllUNcoh- 
EcoRI/HindUL fragment of 1321 bp, is readily cloned into an 
appropriate plant expression cassette. This glyphosate 
oxidoreductase gene (SEQ ID NO:6) was cloned as a B^ZII- 
EcoRI fragment into the plant transformation and expression 
vector pMON979 to form pMON17073. 
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Modification and Resvnthesis of th* Olyphosate Oridoreductase 
Gene Sequence 

The glyphosate oxidoreductase gene from LBAA 
contains sequences that could be inimical to high expression of 
the gene in plants. These sequences include potential 
polyadenylation sites that are often A+T-rich, a higher G+C% 
than that frequently found in plant genes (56% versus -50%), 
concentrated stretches of G and C residues, and codons that are 
not used frequently in plant genes. The high G+C% in the 
glyphosate oxidoreductase gene has a number of potential 
consequences including the following: a higher usage of G or C 
than that found in plant genes in the third position in codons, 
and the potential to form strong hair-pin structures that may 
affect expression or stability of the RNA. The reduction in the 
G+C content of the glyphosate oxidoreductase gene, the 
disruption of stretches of G*s and C's, the elimination of 
potential polyadenylation sequences, and improvements in the 
codon usage to that used more frequently in plant genes, could 
result in higher expression of glyphosate oxidoreductase in 
plants. 

In the first phase of this experiment, selected regions 
of the gene were modified by site-directed mutagenesis. These 
modifications were directed primarily (but not exclusively) at 
reducing the G+C% and at breaking up some of the G+C 
dusters. The manipulated glyphosate oxidoreductase gene was 
first recloned into the phagemid vector pMON7258 as a Ncoh 
HindUl fragment to form pMON17014. Single stranded DNA 
was prepared from a dut ung E. coli strain. Seven regions of 
the gene were modified by site-directed mutagenesis using the 
primers listed in Table IV and the Bio Rad mutagenesis kit 
(Catalog #170-3576) and following the protocols provided with 
this kit. 
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For the sake of clarity, the reverse complement of the 
actual primers is presented. The base positions, in the 
sequences presented in Figure 2 and in Figure 3, corresponding 
to the primers are indicated by the first and second set of 
5 numbers, respectively. 



TABLE IV - Primers to Modify thq Qlvnhoaate 0*iiWrf,,r^gA 
Gene Coding Semienre 



10 PRIMER 1 (149-210; 38-99) 

CGCTGGAGCT GGAATCGTTG GTGTATGCAC TGCTTTGATG CTTCAACGTC 

gtggattcaa ag (SEQ ID NO:27) 



PRIMER 2 (623-687; 512-576) 

15 GCAGATCCTC TCTGCTGATG CTTTGCGTGA TTTCGATCCT AACTTGTCGC 
ATGCTTTTAC CAAGG (SEQ ID NO:28) 

PRIMER 3 (792-832; 681-721) 

GTCATCGGTT TTGAGACTGA AGGTCGTGCT CTCAAAGGCA T (SEQ ID NO:29) 



20 



L4 (833-901; 722-790) 

TACAACCACT AACGGTGTTC TGGCTGTTGA TGCAGCTGTT GTTGCAGCTG 
GTGCACACTC TAAATCACT (SEQ ID NO:30) 



PRIMER fi (1031-1091; 920-980) 

GGAAATGGGT CTTCGTGTTG CTGGTACTGT TGAGTTTGCT GGTCTCACAG 

ctgctcctaa c (SEQ ID NO:31) 

# 

PRIMER 6 (1 179-1 24fi; 1068-1135) 

TGGATGGGTT ttcgtcctag cattcctgat tctcttccag tgattggtcg 
TGCAACTCGT ACACCCGA (SEQ ID NO:32) 
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EBIMEEJ (1247-1315; 1136-1204) 

CGTAATCTAT GCTTTTGGTC ACGGTCATCT CGGTATGACA GGTGCTCCAA 
TGACTGCAAC TCTCGTCTC (SEQ ID NO:33) 

The resultant gene (SEQ ID NO:7) was confirmed by 
sequencing and by the ability to provide comparable glyphosate 
tolerance levels as the manipulated glyphosate oxidoreductase 
gene control This modified gene (SEQ ID NO:7) is referred to 
as "modified glyphosate oxidoreductase." The G+C% of the 
glyphosate oxidoreductase gene (SEQ ID NO:6) was reduced 
from -56% in the manipulated version to -52% in the modified 
version (SEQ ID NO:7). A comparison of the manipulated and 
modified glyphosate oxidoreductase gene is shown in Figure 3, 
with the manipulated version on top and the changes 
introduced to make the modified version on the bottom. This 
modified glyphosate oxidoreductase gene was cloned as a BgllL- 
EcoBI fragment into a plant expression cassette comprising the 
En-CaMV35S promoter and the NOS 3' sequences. This 
cassette was then cloned as a NotI fragment into the pMON886 
vector to form pMON17032 (Figure 5). 

A synthetic glyphosate oxidoreductase gene (SEQ ID 
NO:8) was designed to change as completely as possible those 
inim ica l sequences discussed above. In summary, the gene 
sequence was redesigned to eliminate as much as possible the 
following sequences or sequence features (while avoiding the 
introduction of unnecessary restriction sites): stretches of G's 
and C's of 5 or greater; A+T rich regions (predominantly) that 
could function as polyadenylation sites or potential RNA 
destabilization region, and codons not frequently found in plant 
genes. A comparison of the manipulated (SEQ ID NO:6) and 
synthetic (SEQ ID NO:8) glyphosate oxidoreductase genes is 
presented in Figure 4, with the manipulated gene (SEQ ID 
NO:6) on top and the differences introduced into the synthetic 
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gene (SEQ ID N0:8) on the bottom. The G+C% for the synthetic 
glyphosate oxidoreductase gene is -51% and the potential to 
form short, high energy, hair-pin structures is reduced. This 
synthetic gene was cloned as a Bglll-EcoRI fragment into 
pMON979 to form pMON17065 for introduction into plants. 

Expression of Chloroplast Directed Glyphosate Oxidoreductase 
The glyphosate target in plants, the 5-eno/pyruvyl- 
shikimate-3-phosphate synthase (EPSPS) enzyme, is located in 
the chloroplast. Although glyphosate oxidoreductase activity 
located in the cytoplasm reduces/prevents glyphosate from 
reaching the chloroplast in the transgenic plant, directing the 
glyphosate oxidoreductase enzyme to the chloroplast has been 
found to further minimize the effects of glyphosate on EPSP 
synthase. Many chloroplast-localized proteins are expressed 
from nuclear genes as precursors and are targeted to the 
chloroplast by a chloroplast transit peptide (CTP) that is 
removed during the import steps. Examples of such 
chloroplast proteins include the small subunit (SSU) of Ribulose- 
l,5-bi8phosphate carboxylase (RUBISCO), 5-eno/-pyruvyl- 
shikimate-3-phosphate synthase (EPSPS), Ferredoxin, 
Ferredoxin oxidoreductase, the Light-harvesting-complex 
protein I and protein II, and Thioredoxin F. It has been 
demonstrated in vivo and in vitro that non-chloroplast proteins 
may be targeted to the chloroplast by use of protein fusions with 
a CTP and that a CTP sequence is sufficient to target a protein 
to the chloroplast (della-Cioppa et al., 1987). 

The glyphosate oxidoreductase protein was targeted 
to the chloroplast by construction of a fusion between the 
C-terminus of a CTP and the N-terminus of glyphosate 
oxidoreductase. In the first example, a specialized CTP, 
derived from the SSU 1A gene from Arabidopsis thaliana 
(Timko et al., 1988) was used. This CTP (designated CTP1) was 
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constructed by a combination of site-directed mutageneses. The 
CTP1 structure (SEQ ID NO:9) (Figure 6) is made up of the SSU 
1A CTP (amino acids 1-55), the first 23 amino acids of the 
mature SSU 1A protein (amino acids 56-78), a serine residue 

5 (amino acid 79), a new segment that repeats amino acids 50 to 
56 from the SSU 1A CTP and the first two amino acids from the 
mature protein (amino acids 80-87), and an alanine and 
methionine residue (amino acids 88 and 89). An Ncol 
restriction site is located at the 3' end (spans the Met codon) to 

10 facilitate the construction of precise fusions to the 5' of 
glyphosate oxidoreductase or other genes. At a later stage, a 
BglU site was introduced upstream of the N terminus of the 
SSU LA sequences to facilitate the introduction of the fusions 
into plant transformation vectors. A fusion was assembled 

15 between the CTP1 (SEQ ID NO:9) and the manipulated 
glyphosate oxidoreductase (SEQ ID NO:6) (through the JVcol 
site) in the pGEM3zf(+) vector to form pMONl7034. This vector 
may be transcribed in vitro using the SP6 polymerase and the 
RNA translated with 35S-Methionine to provide material that 

20 may be evaluated for import into chloroplasts isolated from 
Lactuca sativa using the methods described hereinafter (della- 
Cioppa et al., 1986, 1987). This CTPl-glyphosate oxidoreductase 
fusion was indeed found to be imported into chloroplasts at 
about 9% efficiency of that of the control, 35S labeled PreEPSPS 

25 (pMON6140; della-Cioppa et al., 1986). A CTPl-glyphosate 
oxidoreductase fusion was then assembled with the synthetic 
glyphosate oxidoreductase gene (SEQ ID NO:8) and this was 
introduced as a Bglll-EcoBl fragment into plant vector 
pMON979 to form pMON17066 (Figure 7). Following an 

30 intermediate cloning step to acquire more cloning sites, this 
CTPl-glyphosate oxidoreductase fusion was also cloned as a 
Xbal-BamHI site into pMON981 to form pMON17138 (Figure 8). 
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In the second example, a CTP-glyphosate 
oxidoreductase fusion was constructed between the Arabidopsis 
thaliana EPSPS (Klee et aL, 1987) CTP and the synthetic 
glyphosate oxidoreductase coding sequences. The Arabidopsis 

5 CTP was first engineered by site-directed mutagenesis to place 
a Sphl restriction site at the CTP processing site. This 
mutagenesis replaced the Glu-Lys at this location with Cys- 
Met. The sequence of this CTP, designated CTP2, (SEQ ID 
NO: 10) is shown in Figure 9. The Ncol site of the synthetic 

20 glyphosate oxidoreductase gene (SEQ ID NO:8) was replaced 
with a Sphl site that spans the Met codon. The second codon 
was converted to one for leucine in this step also. This change 
had no apparent effect on the in vivo activity of glyphosate 
oxidoreductase in E. coli. The CTP2-synthetic glyphosate 

15 oxidoreductase fusion was cloned into pBlueScript KS(+) and 
this template was transcribed in vitro using T7 polymerase and 
the 35S-methionine-labeled material was shown to import into 
chloroplasts with an efficiency comparable to that for the CTP1* 
glyphosate oxidoreductase fusion. This CTP2-synthetic 

20 glyphosate oxidoreductase fusion was then cloned as a Xbal- 
B a mill fragment into a plant expression vector to form 
pMON17164. A structural map of this plasmid is presented in 
Figure 12. 

The plant vector portion of pMON17164 (Figure 12) is 
25 composed of the following segments. A chimeric kanamycin 
resistance gene engineered for plant expression to allow 
selection of the transformed tissue. The chimeric gene consists 
of the 0.35 Kb cauliflower mosaic virus 35S promoter (P-35S) 
(Odell et aL, 1985), the 0.83 Kb neomycin phosphotransferase 
30 typell gene (KAN), and the 0.26Kb 3'-non-translated region of 
the nopalinee synthase gene (NOS 3 f ) (Fraley et aL, 1983). A 
0.45 Kb Clal to Dral fragment from the pTil5955 octopine Ti 
plasmid, which contains the T-DNA left border region (Barker 
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et al., 1983) A 0.75 Kb segment containing the origin of 
replication from the RK2 plasmid (ori-V) (Stalker et al., 1981) A 
3.0 Kb Sail to PstI segment of pBR322 which provides the origin 
of replication for maintenance in E. coli (ori-322) and the bom 
site for the conjugational transfer into Agrobacterium 
tume facie ns cells. A 0.93 Kb fragment isolated from 
transposon Tn7 which encodes bacterial 
spectinomycin/streptomycin resistance (Spc/Str) (Fling et al., 
1985), and is a determinant for selection in E. coli and 
Agrobacterium tumefaciens. A 0.36 Kb Pvul to Bell fragment 
from the pTiT37 plasmid, which contains the nopaline-type T- 
DNA right border region (Praley et at, 1985). An expression 
cassette consisting of the 0.6 Kb 35S promoter from the figwort 
mosaic vims (P-FMV) (Gowda et al., 1989), several unique 
cloning sites, and the 0.7 Kb 3' nontranslated region of the pea 
rbcS-E9 gene (E9 3') (Coruzzi et al., 1984, and Morelli et al., 
1985). The GTP2-synthetic glyphosate oxidoreductase fusion 
fragment was cloned into this expression cassette. The 
introduction of this -plasmid into Agrobacterium and 
subsequent plant transformation is described in the Examples 
to follow. 

Those skilled in the art will recognize that various 
chimeric constructs can be made which utilize the functionality 
of a particular CTP to import the contiguous glyphosate 
oxidoreductase enzyme into the plant cell chloroplast. The 
chloroplast import of the glyphosate oxidoreductase can be 
determined using the following assay. 

Chloronlast Uptake Assay 

Intact chloroplasts are isolated from lettuce (Latuca 
sativa, var. longifolia) by centrifugation in Percoll/ficoll 
gradients as modified from Bartlett et al. (1982). The final pellet 
of intact chloroplasts is suspended in 0.5 ml of sterile 330 mM 
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sorbitol in 50 mM Hepes-KOH, pH 7.7, assayed for chlorophyll 
(Arnon, 1949), and adjusted to the final chlorophyll 
concentration of 4 mg/ml (using sorbitol/Hepes). The yield of 
intact chloroplasts from a single head of lettuce is 3-6mg 
chlorophyll. 

A typical 300 ul uptake experiment contained 5 mM 
ATP, 8.3 mM unlabeled methionine, 322 mM sorbitol, 58.3 mM 
Hepes-KOH (pH 8.0), 50 ul reticulocyte lysate translation 
products, and intact chloroplasts from L. sativa (200 ug 
chlorophyll). The uptake mixture is gently rocked at room 
•temperature (in 10 x 75 mm glass tubes) directly in front of a 
fiber optic muminator set at maximum light intensity (150 Watt 
bulb). Aliquot samples of the uptake mix (about 50 ul) are 
removed at various times and fractionated over 100 ul silicone- 
oil gradients (in 150 ul polyethylene tubes) by centrifugation at 
11,000 X g for 30 seconds. Under these conditions, the intact 
chloroplasts form a pellet under the silicone-oil layer and the 
incubation medium (containing the reticulocyte lysate) floats on 
the surface. After centrifugation, the silicone-oil gradients are 
immediately frozen in dry ice. The chloroplast pellet is then 
resuspended in 50-100 ul of lysis buffer (10 mM Hepes-KOH pH 
7.5, 1 mM PMSF, 1 mM benzamidine, 5 mM e-amino-n-caproic 
acid, and 30 ug/ml aprotinin) and centrifuged at 15,000 X g for 
20 minutes to pellet the thylakoid membranes. The clear 
supernatant (stromal proteins) from this spin, and an aliquot of 
the reticulocyte lysate incubation medium from each uptake 
experiment, are mixed with an equal volume of 2X SDS-PAGE 
sample buffer for electrophoresis (see below). 

SDS-PAGE is carried out according to Laemmli (1970) 
in 3-17% (w/v) acrylamide slab gels (60 mm X 1.5 mm) with 3% 
(w/v) acrylamide stacking gels (5 mm X 1.5 mm). The gel is 
fixed for 20-30 minutes in a solution with 40% methanol and 
10% acetic acid. Then, the gel is soaked in ENHANCE™ 
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(DuPont) for 20-30 minutes, followed by drying the gel on a gel 
dryer. The gel is imaged by autoradiography, using an 
intensifying screen and an overnight exposure to determine 
whether the glyphosate oxidoreductase is imported into the 
isolated chloroplasts. 

Alternative Isolation Protocol for Oth er Glyphosate 
Oxidoreductase Structural Genes 

A number of other glyphosate oxidoreductase genes 
have been identified and cloned, including the second LBAA 
glyphosate oxidoreductase gene from the Class II cosmid 
pMON7477. The gene was located, by Southern hybridization, 
on the -23 kb HindllL fragment, discussed in the cloning 
section above, using the first glyphosate oxidoreductase gene as 
a probe. Southern analysis also showed P$fl and Bglll 
hybridizing bands of -3.5 and -2.5 kb, respectively. The Bglll 
fragment from pMON7477 was subcloned into the BamHl site of 
pBlueScript vector. A clone in E. coli JM101 (pMON7482), in 
which the cloned fragment was oriented relative to the lac 
promoter as in pMON7469#l, was induced with IPTG and 
assayed for glyphosate oxidoreductase activity. In this 
experiment a Sp. Act. of -93 nmol/min.mg was obtained. In a 
later experiment, Class I and Class II cosmids were also 
isolated following infection of E. coli JM101 with the same 
packaged cosmid preparation and selection directly for 
glyphosate tolerance at 3*5 mM glyphosate on M9 media. 

A glyphosate oxidoreductase geqe has also been 
subcloned from another microbial isolate, identified originally 
by its ability to utilize glyphosate as a phosphorous source and 
later shown to contain a putative glyphosate oxidoreductase 
gene by hybridization with the LBAA glyphosate oxidoreductase 
gene probe. This gene was cloned initially in a T7 promoter 
cosmid by screening for glyphosate tolerance in E. coli 
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HBlOl/pGPl-2 (Boyer and Rolland-Dussoix, 1969; Tabor and 
Richardson, 1985) on M9 medium containing glyphosate at 3 
mM. The presence of the glyphosate oxidoreductase gene was 
first indicated by a positive hybridization signal with the LBAA 
gene and by its location on a 2.5 kb BglTL fragment. This Bglll 
fragment was cloned into the flaroHI site in pBlueScript 
(pMON17183) and expressed from the lac promoter by addition 
of IPTG. In this experiment a glyphosate oxidoreductase with 
a specific activity of 53 nmoles/min.mg was obtained, 
confirming the isolation of the gene by this strategy. The 
following features have usually been found for these glyphosate 
oxidoreductase genes: the genes are found , (by Southern 
hybridization using full-length glyphosate oxidoreductase gene 
probes) on -2.5 kb Bglll fragments, on -3.5 Pstl fragments, 
contain one EcoBI site within the gene and the genes do not 
contain a Jftndlll site. The following schematic diagram 
illustrates some common features of these genes. 




GOX coding region 



The high degree of similarity of glyphosate 
oxidoreductase genes also suggests another way by which new 
glyphosate oxidoreductase genes may be cloned. The apparent 
conservation of regions flanking the genes and the absence of 
certain restriction sites suggests the use of single-stranded 
oligonucleotide probes to the flanking regions, containing 
restriction sites for Bglll, HindUl, Pstl, BamKl, Ndel, or 
other suitable cloning sites, and PCR (Polymerase Chain 
Reaction; see Erlich, 1989, for complete details on PCR and its 
applications) to amplify a glyphosate oxidoreductase gene 
fragment suitable for cloning. The flanking sequences for 119 
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bp upstream (SEQ ID NO:ll) of the wild-type (LBAA isolate) 
glyphosate oxidoreductase gene and for -290 bp (SEQ ID NO:12) 
downstream of the gene are provided in Figure 2. 

Using this PCR approach, glyphosate oxidoreductase 
genes from a number of sources have been isolated. The 
presence of the glyphosate oxidoreductase activity was 
confirmed by cloning the glyphosate oxidoreductase gene from 
chromosomal DNA prepared from Pseudomonas sp. strain LBr 
(Jacob et aL, 1988) and using primers homologous to the N- and 
C-termini of the LBAA glyphosate oxidoreductase gene and 
containing the following suitable restriction cloning sites: 

5 * -GAGAGACTGT CGACTCCGCG GGAGCATCAT ATG-3 ■ (SEQ ID HO: 13) 
and S'-GAACGAATCC AAGCTTCTCA CGACCGCGTA AGTAC-3 1 (SEQ ID 

no : 14) . Cyclotherm parameters used for these PCR reactions is 
as follows: 

Denature at 94° C for 1 minute; 

Anneal at 60° C for 2 minutes; 

Polymerize at 72° C for 3 minutes, 

30 cycles, no autoextension, linked to 4° C incubation. 
The expected -1.3 kb PCR produced was generated and 
following digestion with Ndel and tfindm, this fragment was 
cloned into pMON2123 for expression of the encoded enzyme. 
The glyphosate oxidoreductase activity was measured as 
described above and the Km for glyphosate was similar to that 
for enzymes from LBAA which is presented supra . 

ozidoisdactaae gens 

Pseudomonas sp. strain LBr 25 

Bacteria isolated from glyphosate process waste 
stream treatment facilities may also be capable of converting 
glyphosate to AMP A. Pseudomonas strains LBAA and LBr are 
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two such examples. Such bacteria may also be isolated de novo 
from these waste treatment facilities. 

A population of bacteria was isolated from a fixed-bed 
immobilized cell column, which employed Mannville R-635 
diatomaceous earth beads, by plating on Tryptone Soy Agar 
(Difco), containing cycloheximide at 100 ug/ml, and incubating 
at 28°C. The column had been run for three months on a waste- 
water feed from the Monsanto Company's Luling, MS, 
glyphosate production plant. The column contained 50 mg/ml 
glyphosate and NH 3 as NH 4 C1. Total organic carbon was 300 
mg/ml and BOD's (Biological Oxygen Demand - a measure of 
"soft" carbon availability) was less than 30 mg/ml. This 
treatment column has been described ( Heitkamp et al., 1990); 
One of the predominant members of this population, identified 
as Agrobacterium sp. strain T10, was found to also grow in 
minimal broth in which the sole carbon source provided was 
glyphosate at 10 mM (this broth was made up as for DF 
medium but with glyphosate substituting for the glucose, 
gluconate and citrate). Chromosomal DNA was prepared from 
this isolate and subjected to the same PCR procedure and with 
the same primers as described above for the strain LBr. A 
fragment of the correct size was generated and cloned into the 
E. eoli expression vector. The glyphosate oxidoreductase 
activity was assayed and the K m for glyphosate also determined: 

BQurce Of gene K Jglvphosate: mM) 

Agrobacterium sp. strain T10 28 

Glyphosate-to-AMPA conversion has been reported 
for many different soils (see Malik et al., 1989 for a review) and 
a number of procedures are available for the extraction of total 
DNA from mixed environment samples such as soil (Holben et 
al., 1988; Steffan and Atlas, 1988; Tsai and Olson, 1991), 
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indicating the possibiKty of cloning glyphosate oxidoreductase 
genes without having to first isolate such a degrading 
microorganism. Of course, the procedure described for the 
cloning of the glyphosate oxidoreductase genes, based on the 
conferring of a glyphosate utilization ability or glyphosate 
tolerance on E. co/t, provides a scheme by which other 
glyphosate oxidoreductase genes and other glyphosate 
metabolizing genes may be cloned, without relying on the 
homology determined for the glyphosate oxidoreductase gene 
described here. It is possible also to enrich for glyphosate 
degrading bacteria, for example, by the repeated application of 
glyphosate to a patch of soil (Quinn et al., 1988, Talbot et al. f 
1984). This enrichment step might be used to increase the ease 
with which glyphosate oxidoreductase genes are recovered 
from soil or other environments. 

Evidence for the presence of the glyphosate 
oxidoreductase gene in soil bacteria and a procedure for the 
isolation of such genes is outlined in the following: A 
population of suitable bacteria was enriched for selection of 
bacteria capable of growing in liquid media with glyphosate (at 
10 mil) as a source of carbon (This medium is made up as 
described for the Dworkin-Poster medium but with the 
omission of the carbon sources and with Pi as a source of P). 
The inoculum was provided by extracting soil (from a recently 
harvested soybean field in Jerseyville, Illinois) and the 
population selected by successive culturing in the medium 
described above at 28°C (cydoheximide was included at 100 
ug/ml to prevent growth of fungi). Upon plating on L-agar 
medium, 5 colony types were identified. Chromosomal DNA 
was prepared from 2 ml L-broth cultures of these isolates and 
the presence of the glyphosate oxidoreductase gene was probed 
using PCR screening. Using the primers 
GCCGAGATGACCGTGGCCGAAAGC (SEQ ID NO: 15) and 
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GGGAATGCCGGATGCTTCAACGGC (SEQ ID N0:16),aDNA 
fragment of the predicted size was obtained with the 
chromosomal DNA from one of the isolates (designated S3). 
The PCR conditions used were as follows: 1 minute at 94°C; 2 
minutes at 40°C; 3 minutes at 72°C; 35 cycles. The DNA 
fragment generated in this way is used as a probe (following 
radiolabeling) to isolate the S3 giyphosate oxidoreductase gene 
candidate from a cosmid bank constructed as described for 
LBAA DNA and greatly facilitates the isolation of other 
giyphosate oxidoreductase genes. The primers used are 
homologous to internal sequences in the LBAA giyphosate 
oxidoreductase gene. The PCR conditions employed allow a fan- 
degree of mismatch in the primers and the result suggests that 
the giyphosate oxidoreductase gene from S3 may not be as 
closely related to the other giyphosate oxidoreductase genes that 
were successfully isolated using the primers to the N- and C- 
termini of the LBAA gene. 

A variety of procedures are available for the isolation 
of genes. Some of these procedures are based on the knowledge 
of gene function that allow the design of phenotypic screens to 
aid in the isolation. Others are based on at least partial DNA 
sequence information that allow the use of probes or primers 
with partial or complete homology, or are based on the use of 
antibodies that detect the gene product. All of these options 
may be applied to the cloning of giyphosate oxidoreductase 
genes. 

Improvement of the Kinetic Properti** nf fflahoaatS 

Prior examples of engineered herbicide resistance by 
enzymatic inactivation of the herbicide have utilized enzymes 
with an ability to bind and metabolize the herbicides much 
more efficiently than -giyphosate oxidoreductase metabolizes 
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glyphosate. The glyphosate oxidoreductase enzyme has a K m for 
glyphosate of 20-30 mM and, as a result, the reaction rate for 
the degradation of glyphosate may be enhanced for optimal 
efficiency in transgenic plants by either lowering the K m or by 
raising the V max . 

Random mutagenesis techniques coupled with 
appropriate selections and/or screens are powerful tools which 
have been used successfully to generate large numbers of 
mutagenized gene sequences and potential variants. The same 
approaches may be used to isolate and to identify glyphosate 
oxidoreductase variants with improved glyphosate degradation 
efficiency. The mutagenesis techniques that may be employed 
include chemical mutagenesis of bacterial cultures containing 
the gene of interest or of purified DNA containing this gene and 
PCR methods used to generate copies of the gene (or portions of 
it) under conditions that favor miaincorporation of nucleotides 
(errors) into the new strand. An example of such a condition 
would be carrying out the PCR reaction in the presence of Mn*+. 

Appropriate in vivo screens for improved variants 
following the mutagenesis could include those for improved 
glyphosate tolerance in E. coli or increased growth on 
glyphosate in Mpu+ strains. For the screen, the glyphosate 
oxidoreductase gene is cloned into a vector containing a weak 
bacterial promoter and/or in a replicon with a low copy 
number. The glyphosate tolerance phenotypes of different 
glyphosate oxidoreductase constructs have been shown to vary 
over a range of glyphosate concentrations and to correlate with 
the level of glyphosate oxidoreductase expression. For examp le, 
under uninduced conditions, P/ac-glyphosate oxidoreductase 
vectors express less glyphosate oxidoreductase than PrecA- 
glyphosate oxidoreductase vectors and also display lower 
glyphosate tolerance. The mutagenized gene fragment is 
cloned into the most suitable vector and the resultant library 
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screened. Variants are selected for their ability to grow at 
glyphosate levels which inhibit growth of the control strain 
containing the parent glyphosate oxidoreductase clone. 
Glyphosate oxidoreductase activity confers on E. coli the ability 
to convert glyphosate to AMPA and, in suitable E. coli strains, 
this AMPA can provide a source of phosphate following 
cleavage of the C-P bond by C-P lyase. Suitable E. coli strains 
are B strains or Mpu+ derivatives of K strains. The glyphosate 
oxidoreductase gene confers minimal growth on glyphosate as 
the sole phosphorus source in strain E. coli JM101 Mpu+ (= 
GB993). The growth rate on glyphosate has been shown to also 
correlate with the glyphosate oxidoreductase expression level. 
The mutagenized glyphosate oxidoreductase gene is cloned into 
the appropriate vector and the variant library screened by 
differential growth rates on plates or by culturing in media 
containing glyphosate as sole phosphorous source. Clones 
which demonstrate faster growth on plates relative to the 
control strain are subsequently re-screened by growth curve 
analysis. 

Glyphosate oxidoreductase variants which have been 
identified in each selection/screen are cloned into a vector for 
high-level expression and subjected to enzyme analysis to 
determine Kn, and V max values for glyphosate. The best 
glyphosate oxidoreductase variants are purified for complete 
kinetic characterization. Glyphosate oxidoreductase variants 
which have been identified with lower Km values and similar or 
higher V max values than wild-type enzyme values are analyzed 
by nucleic arid sequencing to determine the mutation(s). The 
goal in isolating variants would be to increase the kcat/Km ratio 

for glyphosate oxidoreductase-catalyzed glyphosate 
degradation. 

A variant with such improvements was isolated. 
The mutagenesis procedure used was that of Mn++-poisoned 
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PCR and the template was a linearized glyphosate 
oxidoreductase gene plasmid containing the synthetic 
glyphosate oxidoreductase gene (SEQ ID NO:8). The 
oligonucleotide primers used were homologous to regions in the 
vector and fla nkin g the glyphosate oxidoreductase gene. The 
PCR conditions employed were as follows: 1 minute at 94°C, 2 
minutes at 55°C, and 3 minutes at 72°C and with 35 cycles. A 
5:1 ratio of dCTP+dGTP+TTP to dATP was used. The reactions 
contained MnCfe at 125, 250, 375, or 500 *iM. After the reaction, 
the amplified product was recloned into a vector containing a 
weak E. coli promoter. This vector was a pBR327 derivative 
containing the araBAD promoter and suitable cloning sites. 
One hundred colonies from this cloning step were then 
screened in E. coli GB993 for improved glyphosate tolerance and 
utilization phenotypes in media composed of MOPS minimal 
medium with glyphosate and Pi or with glyphosate alone, 
respectively. Growth rates were determined by measuring A550 
over a 96 hour period. Three clones were identified that 
exhibited faster growth rates in these screens. These 
transformants had a 1.5-2.0-fold faster utilization phenotype. 
The glyphosate oxidoreductase gene was recloned into the 
expression vector portion and this phenotype verified. All 
kinetic analysis was performed on crude E.coli lysates. 
Putative glyphosate oxidoreductase variant proteins were 
overexpressed after subcloning the NcoVHindUl variant 
glyphosate oxidoreductase gene into VrecA-genelOL expression 
vector. For overexpression in PrecA-genelOL constructs, GB993 
cells containing the vector were induced at a Klett=110-120 in 
M9 minimal medium with 50 ug/ml nalidixic acid and allowed 
to grow for 2.5 hours at 37°C with vigorous shaking. Cells were 
harvested by centrifugation at 4000g, 5 minutes at 4°C, and 
resuspend in 100 mM Tris-HCl, pH 7.1, 1 mM EDTA, 35 mM 
KC1, 20% glycerol, and'l mM benzamidine at 3ml/g cell pellet 
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Lysates were prepared by breaking the cells in a French press, 
twice, at 1000 psi. Insoluble debris was removed by 
centrifugation at 12000g, 15 minutes at 4°C, and the 
supernatant was de-salted by passing over a PD-10 column 
(Sephadex G-25, Pharmacia). The void volume fraction was 
used as the source of enzyme for kinetic analysis. Protein 
concentrations were determined using the Bio-Rad protein dye- 
binding assay. Time and enzyme concentration courses were 
performed to determine linear ranges. The enzyme assay was 
performed as follows: lysate and glyphosate oxidoreductase mix 
(final concentration = 0.1 M MOPS, 0.01 M Tricine, pH 7.4, 0.01 
mM FAD, 10 mM MgCl 2 ) in a 100 ul reaction were pre- 
incubated at 30°C for 2 minutes prior to the addition of 
glyphosate (analytical grade stock prepared in water adjusted 
to pH 7.0 with NaOH). Ten minutes was determined to be the 
optimal time for the enzyme assay using 10 ug lysate. After 10 
minutes at 30°C with shaking, 0.25 ml dinitophenylhydrazine 
(DNPH) reagent (0.5 mg/ml in 0.5 M HC1) was added and the 
reaction was allowed to proceed for an additional 5 minutes at 
30°C with shaking. A 1.5 M NaOH solution (400^1) was then 
added to the assay mix, and the reaction was continued for 5 
minutes at 30°C with shaking. Enzyme activity was 
determined from the amount of glyoxylate-DNPH adduct 
formed by measuring A 52 o against a standard of glyoxylate. 
Enzyme assays are performed in duplicate on at least two 
different single colony isolates of a putative glyphosate 
oxidoreductase variant. To determine K m and V max , enzyme 
assays were performed over a (0.2-2.0) x K m range of glyphosate 
concentrations. The K m and V max were determined from 
Lineweaver Burk, Eadie-Hofstee and hyperbolic kinetic plots. 
Vmax was estimated after determining the amount of 
immunoreactive glyphosate oxidoreductase protein in lysates by 
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immunoblot analysis as described below. Immunoblot analysis 
was performed following SDS-PAGE and transfer of protein 
from the gel to nitrocellulose at 500 mA in a Hoeffer transfer 
apparatus in 25 mM Tris-HCl, 192 mM glycine containing 0.1% 
SDS and 25% methanol for 1-2 hours. After transfer, the 
nitrocellulose was incubated with 50 mM Tris-HCl, pH7.5, 0.9% 
NaCl, 0.01% Tween 20, 0.02% NaN 3 containing 2% bovine 
serum albumin at room temperature with shaking for at least 
30 minutes. After blocking, the same buffer containing a 
1:25,000 dilution of goat anti-glyphosate oxidoreductase 
antiserum was added and the filter was allowed to shake at 
room temperature for 45 minutes. After incubation with 
primary glyphosate oxidoreductase antibody, the filter was 
washed for 45 glyphosate oxidoreductase minutes in buffer 
without antibody; buffer containing a 1:5000 dilution of rabbit 
anti-goat alkaline phosphatase-conjugated second antibody 
(from Pierce) was added and the filter was incubated for 45 
minutes at room temperature with shaking. The filter was 
then washed in buffer without antibody for 30 minutes prior to 
addition of NBT and BCIP (Promega) to allow color 
development. Immunoreactive glyphosate oxidoreductase 
protein was also quantitated by dot blotting the lysate onto 
nitrocellulose and then processing the filter as described above, 
except that 1281-Protein G was used for detection. The amount 
of glyphosate oxidoreductase protein in lysates was determined 
by counting the dot and comparing the amount of radioactivity 
against a glyphosate oxidoreductase protein standard. One 
variant, v.247, showed a 3-4-fold higher specific activity for 
glyphosate oxidoreductase at 25 mM glyphosate and the 
immunoblot analysis indicated that this was not due to an 
elevated glyphosate oxidoreductase protein level. Subsequent 
assays indicated that this variant had a 10-fold lower K m for 
glyphosate than the wild type glyphosate oxidoreductase. In a 



similar manner the K m for IDA was also determined and these 
data are presented below. 

Kinetic analysis of rtvnhoMte nTidnr e ductasp varj fl n ^ ; 

app Ka (mM) app V m (U/mg) 

Variant film ISA film U2A film JDA 

wild type 27.0 2.8 0.8 0.5 .03 .18 

v.247 2.6 0.7 0.6 0.7 .23 1.0 



The glyphosate oxidoreductase gene from v.247 was 
sequenced (SEQ IDJ*0:17) and five nucleotide changes were 
found. These changes are described in the following as they 
relate to the codons: GCT to GCC (codon 43), no amino acid 
change; AGC to GGC (codon 84), Ser to Gly; AAG to AGG 
(codon 153), Lys to Arg; CAC to CGC (codon 334), His to Arg, 
and CCA to CCG (codon 362), no amino acid change. The 
amino acid sequence of the glyphosate oxidoreductase gene 
from v.247 is presented as SEQ ID NO:18. The importance of 
these different amino acid changes was determined initially by 
recloning the altered regions into wild type glyphosate 
oxidoreductase and determining the effect on glyphosate 
oxidoreductase activity and kinetics. This was accomplished by 
recloning the Ncol-Nhel fragment (contains codon 84), the 
Nhel-ApaLl fragment (contains codon 153), and the ApaLI- 
Hindni fragment (contains codon 334), seperately into the wild 
type gene. These glyphosate oxidoreductase genes were then 
expressed and the kinetic analyses performed. The data are 
presented below and indicate that the change that occured in 
the ApaLI-ffindlll fragment (contains codon 334) was 
responsible solely for the alteration in the enzyme. 
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Kinetic analvsia of domain cunfrHoa 




Clone 




appV m (U/mg) 


m m 


wt (wlw2w3*) 


28.4 


0.65 




V^rk i \VXVi3VO J 


9 1 


0.72 


0.34 


wlv2w3 


23.5 


0.62 


0.026 


wlv2v3 


2.1 


0.6 


ft OQ 


wlw2v3 


2.0 


0.75 


a one 
0.375 


vlw2v3 


2.6 


0.55 


0.21 


vlw2w3 


28.0 


0.75 


0.027 


vlv2w3 


26.7 


0.55 


0.021 


* wl=SER84; w2=LYS153; w3=fflS334 




** vl=GLY84; v2=ARG153; v3=ARG334 





15 This result was confirmed and extended by repeating 

the His to Arg change at codon 334 and introducing other 
specific changes at this residue by site-directed mutageneses. 
The primers used are listed in the following: Aca - cgttctctac 

ACTCGTGCTC GTAA6TTGC (SEQ ID NO: 19); Uta. - CGTTCTCTAC 
20 ACTAAGGCTC GTAAGTTGC (SEQ ID NO: 20); fila. - CGTTCTCTAC 
ACTCAAGCTC GTAAGTTGC (SEQ ID NO: 21) ; and Ala. - CGTTCTCTAC 

actgctgctc gtaagttgc (seq id N0:22) (These sequences are the 
antisense to those actually used). The presence of these 
changes was confirmed by sequencing the mutagenized 
25 glyphosate oxidoreductase genes and a kinetic analysis of the 
expressed glyphosate oxidoreductase enzymes was performed. 
The data are presented in the following and show that a 
number of substitutions are possible at this position and which 
result in an enzyme with altered kinetic properties. 

30 
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Kinetic analysis of glyphosate imrin r e duetasp va riant; 



5 



app Km (mM) 


app V m 


(U/mff) 


V /PC 


Variant GIvd 


IDA 


Glvn 


IRA 


Glvp roA 


wild type 27.0 


2.8 


0.8 


0.5 


.03 .18 


v.247 2.6 


0.7 


0.6 


0.7 


.23 1.0 


ARG 334 2.6 


0.5 


0.6 


0.6 


.23 1.2 


LYS334 9.9 


1.3 


0.7 


0.8 


.07 .62 


GLN334 19.6 


3.5 


0.6 


0.7 


.03 .20 


ALA 334 26.7 


3.5 


02 


0.2 


.007 .057 



10 

Additonal mutageneses were performed to change 
the His334 residue to other amino acids. The primers to 
accomplish this and the new codon are listed in the following: 

15 US. " CTCTACACTATCGCTCGTAAGCTTCTTCCAGC (SEQ ID NO: 24); 

LfiU - CTCTACACTCTGGCTCGTAAGCTTCTTCCAGC (SEQ ID NO: 25) ; and 
Slu - CTCTACACTGAAGCTCGTAAGCTTCTTCCAGC (SEQ ID NO: 26) 

(These sequences are the antisense of those actually used; these 
primers also , add a "silent" HindUI that facilitates the 

2^ identification of the mutagenized progeny from the population). 
The GLU334 variant retains substantial glyphosate 
oxidoreductase activity, while the TRP334, ELE334, and LEU334 
variants retain much less activity. 

Prom the first generation variants, those with the 

25 highest k^t/Kn ratio are preferably subjected to a second round 
of mutagenesis followed by subsequent screening and analysis. 
An= alternative approach would be to construct second 
generation glyphosate oxidoreductase variants by combining 
single point mutations identified in the first generation 

30 variants. 



-52- 



PLANT TRANSFORMATION 

Plants which can be made glyphosate tolerant by 
practice of the present invention include, but are not limited to, 
soybean, cotton, corn, canola, oil seed rape, flax, sugarbeet, 
sunflower, potato, tobacco, tomato, wheat, rice, alfalfa, lettuce, 
apple,. poplar and pine. 

A double-stranded DNA molecule of the present 
invention ("chimeric gene") can be inserted into the genome of 
a plant by any suitable method. Suitable plant transformation 
vectors include those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed, e.g., by 
Herrera-Estrella (1983), Bevan (1984), Klee (1985) and EPO 
publication 120,516 (Schilperoort et al.). In addition to plant 
transformation vectors derived from the Ti or root-inducing (Ri) 
plasmids of Agrobacterium, alternative methods can be used to 
insert the DNA constructs of this invention into plant cells. 
Such methods may involve, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free 
DNA delivery via microprojectile bombardment, and 
transformation using viruses or pollen. 

The pMON979 plant transformation/expression 
vector was derived from pMON886 (described below) by 
replacing the neomycin phosphotransferase typell (KAN) gene 
in pMON886 with the 0.89 kb fragment containing the bacterial 
gentamicin-3-N-acetyltransferase type HI (AAC(3)-III) gene 
(Hayford et al., 1988). The chimeric P-35S/AA(3)-in/NOS 
3' gene encodes gentamicin resistance which permits selection 
of transformed plant cells. pMON979 also contains a 0.95 kb 
expression cassette consisting of the enhanced CaMV 35S 
promoter (Kay et al., 1987), several unique restriction sites, and 
the NOS 3' end (P-En-CaMV35S/NOS 3'). The rest of the 
pMON979 DNA segments are exactly the same as in pMON886. 



-53- 



Plasmid pMON886 is made up of the following 
segments of DNA. The first is a 0.93 kb Aval to 
engineered-£coRV fragment isolated from transposon Tn7 that 
encodes bacterial spectinomycin/streptomycin resistance 
(Spc/Str), which is a determinant for selection in E. coli and 
Agrobacterium tumefaciens. This is joined to the 1.61 kb 
segment of DNA encoding a chimeric kanamycin resistance 
which permits selection of transformed plant cells. The 
chimeric gene (P-35S/KAN/NOS 3*) consists of the cauliflower 
mosaic virus (CaMV) 35S promoter, the neomycin 
phosphotransferase typell (KAN) gene, and the 
3-nontranslated region of the nopaline synthase gene (NOS 3') 
(Fraley et al., 1983). The next segment is the 0.75 kb oriV 
containing the origin of replication from the RK2 plasmid. It is 
joined to the 3.1 kb Sail to Pvul segment of pBR322 (ort322) 
which provides the origin of replication for maintenance in 
E. coli and the bom site for the conjugations^ transfer into the 
Agrobacterium tumefaciens cells. The next segment is the 0.36 
kb Pvul to Bell from pTiT37 that carries the nopaline-type 
T-DNA right border (Fraley et al M 1985). 

The pMON981 plasmid contains the following DNA 
segments: the 0.93 kb fragment isolated from transposon Tn7 
encoding bacterial spectinomycin/streptomycin resistance 
[Spc/Str; a determinant for selection in E. coli and 
Agrobacterium tumefaciens (Fling et al, 1985)]; the chimeric 
kanamycin resistance gene engineered for plant expression to 
allow selection of the transformed tissue, consisting of the 0.35 
kb cauliflower mosaic virus 35S promoter (P-35S) (Odell et al., 
1985), the 0.83 kb neomycin phosphotransferase typell gene 
(KAN), and the 0.26 kb 3'-nontranslated region of the nopaline 
synthase gene (NOS 3') (Fraley et al., 1983); the 0.75 kb origin of 
replication from the RK2 plasmid (oriV) (Stalker et al., 1981); 
the 3.1 kb Sail to Pvursegment of pBR322 which provides the 
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10 



origin of replication for maintenance in E. coli (ori-322) and the 
bom site for the conjugation^ transfer into the Agrobacterium 
tumefaciens cells, and the 0.36 kb Pvul to Bell fragment from 
the pTiT37 plasmid containing the nopaline-type T-DNA right 
border region (Fraley et al., 1985). The expression cassette 
consists of the 0.6 kb 35S promoter from the figwort mosaic 
virus (P-FMV) (Gowda et al., 1989) and the 0.7 kb 3' non- 
translated region of the pea rbcS-E9 gene (E9 3') (Coruzzi et al., 
1984, and Morelli et al., 1985). The 0.6 kb Sspl fragment 
containing the FMV35S promoter (Figure 1) was engineered to 
place suitable cloning sites downstream of the transcriptional 
start site. 

The plant vector was mobilized into the ABI 
Agrobacterium strain. The ABI strain is the A208 
15 Agrobacterium tumefaciens carrying the disarmed Ti plasmid 
pTiC58 (pMP90RK) (Koncz and Schell, 1986). The Ti plasmid 
does not carry the T-DNA phytohormone genes and the strain is 
therefore unable to cause the crown gall disease. Mating of the 
plant vector into ABI was done by the triparental conjugation 
system using the helper plasmid pRK2013 (Ditta et al., 1980). 
When the plant tissue is incubated with the ABI::plant vector 
conjugate, the vector is transferred to the plant cells by the vir 
functions encoded by the disarmed pTiC58 plasmid. The vector 
opens at the T-DNA right border region, and the entire plant 
vector sequence may be inserted into the host plant 
chromosome. The pTiC58 Ti plasmid does not transfer to the 
plant cells but remains in the Agrobacterium. 



20 



25 



30 



PLANT RF.aBMF.P^TTON 

When adequate production of the glyphosate 
oxidoreductase activity is achieved in transformed cells (c: 
protoplasts), the cells (or protoplasts) are regenerated into 
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whole plants. Choice of methodology for the regeneration step 
is not critical, with suitable protocols being available for hosts 
from Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae 
(carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, 
etc.), Cucurbitaceae (melons and cucumber), Gramineae 
(wheat, rice, corn, etc.), Solanaceae (potato, tobacco, tomato, 
peppers) and various floral crops. See, e.g., Ammirato, 1984; 
Shimamoto, 1989; Fromm, 1990; Vasil, 1990. 

The following examples are provided to better 
elucidate the practice of the present invention and should not be 
interpreted in any way to limit the scope of the present 
invention. Those skilled in the art will recognize that various 
modifications, truncations, etc. can be made to the methods and 
genes described herein while not departing from the spirit and 
scope of the present invention. 

EXAMPLES 

Expression. Activity and Phenotvpe of Glyphosate 
Oxidoreductase in Transformed PlantA 

The transformation, expression and activity of 
glyphosate oxidoreductase, and the glyphosate tolerance 
phenotype imparted to the plants by the glyphosate 
oxidoreductase genes, introduced into Nicotiana tabacum cv. 
"Samsun" and/or Brassica napus cv. Westar using the vectors 
PMON17073, pMON17032, pMON17065, pMON17066, 
pMON17138, and pMON17164, is described, in the following 
exemplary embodiments. Initial data in tobacco on the 
expression of the manipulated glyphosate oxidoreductase gene 
(SEQ ID NO:6) under the control of the En-CaMV35S promoter 
(see data on pMON17073 in Tables VIII and DC, for example) 
indicated only low levels of expression of glyphosate 

m 

oxidoreductase. The transcription of the gene was confirmed in 
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the case of 3-4 plants by Northern and Si analysis but no 
glyphosate oxidoreductase protein could be detected (limit of 
detection in that assay was -0.01% expression level). Analysis 
of Ro plants following spray with 0.4 lb/acre (approximately 
0.448 kg/ha) glyphosate also showed only low levels of tolerance. 
Modification of the gene sequence (as described herein) resulted 
in improved expression in tobacco, as did the use of the FMV 
promoter and the use of a CTP fusion to the glyphosate 
oxidoreductase gene. For these reasons the majority of the data 
presented comes from transgenic plants derived using vectors 
containing these improved glyphosate oxidoreductase 
constructs. One set of experiments with the modified 
glyphosate oxidoreductase vector pMON17032 are presented in 
example 1 and a study of manipulated glyphosate 
oxidoreductase, synthetic glyphosate oxidoreductase, and CTP1- 
synthetic glyphosate oxidoreductase is presented in example 2. 
The transformation and expression of glyphosate 
oxidoreductase in canola is described in examp le 3. 

Example 1 

The tobacco leaf disc transformation protocol employs 
healthy leaf tissue about 1 month old. After a 15-20 minute 
surface sterilization with 10% Clorox plus a surfactant, the 
leaves were rinsed 3 times in sterile water. Using a sterile 
paper punch, leaf discs are punched and placed upside down 
on MS104 media (MS salts 4.3 g/1, sucrose 30 g/1, B5 vitamins 
500X 2 ml/1, NAA 0.1 mg/l, and BA 1.0 mg/1) for a 1 day 
preculture. 

The discs were then inoculated with an overnight 
culture of disarmed Agrobacterium ABI containing the subject 
vector that had been diluted 1/5 (ie: about 0.6 OD). The 
inoculation was done by placing the discs in centrifuge tubes 
with the culture. After-30 to 60 seconds, the liquid was drained 
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off and the discs were blotted between sterile filter paper. The 
discs were then placed upside down on MS104 feeder plates 
with a filter disc to co-culture. 

After 2-3 days of co-culture, the discs were 
transferred, still upside down, to selection plates with MS104 
media. After 2-3 weeks, callus formed, and individual clumps 
were separated from the leaf discs. Shoots were cleanly cut 
from the callus when they were large enough to distinguish 
from stems. The shoots were placed on hormone-free rooting 
media (MSO: MS salts 4.3 g/1, sucrose 30 g/1, and B5 vitamins 
500X 2 ml/1) with selection. Roots formed in 1-2 weeks. Any leaf 
callus assays are preferably done on rooted shoots while still 
sterile. Rooted shoots were placed in soil and were kept in a 
high humidity environment (ie: plastic containers or bags). 
The shoots were hardened off by gradually exposing them to 
ambient humidity conditions. 

A total of 45 Kanamycin resistant pMON17032 tobacco 
lines were examined (Table V). 

TABLE V - Expression of Modified Ohrnhnsate 0*iHnr»diiH* a p 

Gene in Tobacco 

(Rl Transgenics of pMON17032) 



fcELanta 



45 



Glyphosate Recallusing 
(0,5mM fiivphoflflte) 

+ 4/- 

0 11 34 



Western Analvsia of Plants* 



' 24 



+ 



21 



*+ means 0.5 - 2 ng/50 \xg protein 
- means <0.5 ngftO \ig protein 
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Leaf recallusing on plant tissue culture media 
indicated a low level of glyphosate tolerance (rated as a +/- 
phenotype) for at least 11 of these lines. At least 24 of these lines 
expressed a detectable level of glyphosate oxidoreductase in the 
range of 0.5 to 2 ng per 50 ug of extractable protein. The 
glyphosate tolerance displayed in the leaf recallusing assay and 
the higher glyphosate oxidoreductase expression level indicate 
that the changes made to the glyphosate oxidoreductase coding 
sequences to make the modified glyphosate oxidoreductase gene 
(SEQ ID NO:7) had a marked-effect on the ability of this gene to 
be expressed in plants. This same effect could also then be 
achieved by expressing the manipulated glyphosate 
oxidoreductase gene (SEQ ID NO:6) using stronger plant 
promoters, using better 3' polyadenylation signal sequences, 
optimizing the sequences around the initiation codon for 
ribosome loading and translation initiation, or by combinations 
of these or other expression or regulatory sequences or factors. 
The Rl progeny of a number of these lines, including those with 
the highest glyphosate oxidoreductase expression level (#'s 
18854 and 18848) were sprayed with glyphosate at rates of 0.4 
and 1.0 lb/acre (0.448 and 1.12 kg/ha, respectively) and 
vegetative performance rated over a period of four weeks (Table 
VI). 
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TABLE VI - Tobacco Spray Data for nMON1 7032 R1 Plant* 

Vegetative fimre * 



5 



10 



15 



20 



Line 


Rate 
kg/ha 


7 Days 


14Davs 


28 Days 


loooU 


A A AO 

0.448 


3 


3 


4 




1 10 
LIZ 


1 


1 


2 


1 CQy40 


0.448 


4 


6 


8 




L12 


2 


3 


6 


1 QQ>f O 

loo4o 


0.448 


3 


4 


8 




1 10 

1.12 


2 


2 


6 


looo4 


0.448 


4 


7 


9 




1.12 


2 


5 


8 


18858 


0.448 


3 


4 


6 




1.12 


1 


2 


4 


18885 


0.448 


4 


5 


8 




1.12 


2 


1 


2 


18890 


0.448 


3 


6 


7 




L12 


1 


2 


3 


Samsun 


0.448 i 


1 


1 


2 




L12 


1 


1 


0 



* Vegetative Score 



OsDead 

10 = No detectable effect 

Following an initial lag, and especially for those 
plants expressing the highest levels of glyphosate 
oxidoreductase, these lines showed vegetative glyphosate 
tolerance at both spray rates (that improved with time). 
Glyphosate ozidoreductase enzyme activity was determined for 
two of the pMON17032 lines (#'s 18858 and 18881). Leaf tissue 
(lg) was harvested, frozen in liquid N2, and stored at -80°C prior 
to extraction. For extraction, leaf tissue was pulverized in a 
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mortar and pestle with liquid N2. To the powdered leaf tissue 
was then added 1 ml extraction buffer (100 mM TrisCl, pH 7.4, 1 
mM EDTA, 20% glycerol, 35 mM KC1, 1 mM benzamidine HC1, 
5 mM Na ascorbate, 5 mM dithiothreitol, and 1 mg/ml bovine 
serum albumin, 4°C), and the sample was further ground for 1 
minute. The resulting mixture was centrifuged for 5 minutes 
(high speed, Eppendorf) and the supernatant was treated with a 
saturated ammonium sulfate solution to give 70% final 
saturation (2.33 ml saturated solution/ml extract). The 
precipitated protein was collected by centrifugation as above, 
and the pellet was resuspended in 0.4 ml of extraction buffer. 
After centrifuging again to remove particulate matter, the 
sample was desalted using Sephadex G50 contained in a 1 ml 
syringe, equilibrated with extraction buffer, according to the 
method of Penefsky (1979). The desalted plant extracts were 
stored on ice, and protein concentrations were determined by 
the method of Bradford (1976). Glyphosate oxidoreductase 
reactions were carried out in duplicate for 60 minutes at 30°C in 
an assay mixture of 0.1 MOPS/0.01 tricine buffer, pH 7.4, 
containing 10 mM MgCl 2 , 0.01 mM flavin adenine dinucleotide 
(FAD, Sigma), and 1 mM ubiquinone Qo, (Sigma). Plant 
extracts (75 ul) were preincubated in the assay mixture for 2 
minutes, and reactions were then initiated by adding 
iminodiacetic acid (IDA, 20 ul) substrate to a final 
concentration of 50 mM (total assay volume was 0.2 ml). 
Reactions were quenched and derivatized as' described below. 
Control reactions omitting IDA and omitting plant extract were 
also performed. Glyoxylate detection was carried out using 2,4- 
dinitrophenylhydrazine (2,4-DNPH) derivatization and reverse 
phase high performance liquid chromotography (HPLC), using 
a modification of the method of Qureshi et al. (1982). 
Glyphosate oxidoreductase reactions (0.2 ml) were quenched 
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with 0.25 ml of DNPH reagent (0.5 mg/ml DNPH [Aldrich] in 
0.5 M HC1) and allowed to derivatize for 5 minutes at 25°C. The 
samples were then extracted with ethyl acetate (2 x 0.3ml) and 
the combined ethyl acetate extracts were extracted with 10% 
Na 2 CC>3 (0.3 ml). The Na 2 C0 3 phase was then washed once 
with ethyl acetate (0.2 ml) and the Na2C(>3 phase injected (100 
ul) on a Beckman Ultrasphere C18 IP HPLC column (5 u, 4.6 
mm x 25 cm) using an LKB GTi binary HPLC system with a 
Waters 990 photodiode array UV/VEJ HPLC detector, via a 
Waters WISP HPLC autoinjector. The isocratic mobile phase 
was methanol-water-acetic acid (60:38.5:1.5) with 5 mM 
tetrabutylammonium phosphate (Pierce). The DNPH- 
glyoxylate peak (retention time = 6.7 minutes) was detected at 
365 nm and compared to a glyoxylate standard (Sigma, 20 uM 
in 0.2 ml) derivatized in exactly the same manner. 

TABLE VII - Glvphoaate oxidoreductase Activity of Transcrpnir 

Tobacco Plants 

Plant Specific Activity 



Example 2 

A series of transformed tobacco lines were derived 
using the "isogenic" glyphosate oxidoreductase vectors 
pMON17073 (manipulated glyphosate oxidoreductase) (SEQ ID 
NO:6), pMON17065 (synthetic glyphosate oxidoreductase) (SEQ 
ID NO:8), and pMO*N17066 (CTPl-synthetic glyphosate 



nmol/min mg 



Samsun 

18881 

18858 



0 (not detectable) 

0.039 

0.018 
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oxidoreductase). By Western analysis (see Table VII below) of a 
number of these lines, the manipulated glyphosate 
oxidoreductase plants were found to express up to -0.5 ng 
glyphosate oxidoreductase per 50 ug plant protein, the synthetic 
glyphosate oxidoreductase at levels from -0.5 - 2 ng per 50 ug, 
and at levels from -2 - 20 ng per 50 ug for the CTPl-synthetic 
glyphosate oxidoreductase plants. 

10 



30 
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TABLE VTTT • Gbahoaate Qadflredackas 

Expression i n Tnbar.cn 

Qsmsima Elanil Wsstsin Bating 
pMON17073 21270 o 
(manipulated) 21281 o 
21286 i 



21929 i 

pMON17066 21237 i 

(CTP1- 21830 o 

synthetic) 21845 3 

21872 3 

21889 1 

21891 0 

pMON17065 21199 0 

(synthetic) 21208 2 

21211 2 

21217 0 

21218 2 
21792 1 
21795 0 
21811 2 



Western rating scale per 50 ug of protein: 

0 -jio detectable glyphosate oxidoreductase 

1 - <.5ng 

2 - .5ng - 2ng 
3 ->2ng 

A number of primary transformants R 0 lines, 
expressing manipulated or synthetic glyphosate oxidoreductase 
or CTPl-synthetic glyphosate oxidoreductase, were sprayed 
with glyphosate at 0.4 lb/acre (0.448 kg/ha) and rated as before. 
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TABLE IX ■ Glvnhnsate .^prav Data' pMO^f^ 
(CTPl-Olvrhosate OTiHn rf ^ irtn c f ) 



10 



Ling 

Control A 
Control B 
Control C 



gfiatfim R«Hnp 
0 
0 
0 



(Spray RntP = ft A Ib^age} (ft 41fl Hrftlfl) 

2 Ji 28 (davs aftpr wny ) 

3 0 0 no detectable 
3 10 gtyphosate 

3 11 oxidoreductase 



15 



20 



25 



30 



22933 


1 


3 


1 


0 


(PMON17073) 


22741 


2 


2 


1 


9 


(DMON17065) 


22810 


3 


3 


4 


6 


(PMON17066) 




1 


2 


1 


1 


(pMON17066) 




3 


10 


10 


10 


(PMON17066) 


22844 


3 


10 


10 


10 


(PMON17066) 


22854 


3 


9 


10 


10 


(PMON17066) 


22860 


3 


8 


10 


10 


(PMON17066) 


22880 


1 


3 


2 


9 


(PMON17066) 


22881 


2 


2 


0 


0 


(pMONl7066) 


22886 


3 


9 


10 


10 


(PMON17066) 


22887 


3 


9 


10 


10 


(DMON17066) 



Western rating scale 
(per 50 jig protein) 

0 = no detectable glyphosate oxidoreductase 

1 = <0.5ng 

2 = 0.5- 2ng 

3 = >2ng 

# Vegetative score: 0 = dead; 

10 = no detectable effect 
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The synthetic glyphosate oxidoreductase line 
displayed a response similar to that noted for the modified 
glyphosate oxidoreductase Ri plants, in that there was some 
immediate glyphosate effects that were overcome with time, 
through the metabolism of the herbicide by glyphosate 
oxidoreductase to the derivatives AMPA and glyoxylate. Since 
the target of glyphosate (EPSP synthase) is located in the 
chloroplast, the activity of glyphosate oxidoreductase must be 
reducing the level of glyphosate within this organelle by 
removing the herbicide before it reaches the chloroplast. The 
CTPl-synthetic glyphosate oxidoreductase plants displayed a 
superior glyphosate tolerance in that these plants did not show 
much, if any, immediate glyphosate effects at the treated rate. 
In general, the treated tolerant plants also showed normal 
development, flowering and fertility. 

The CTPl-synthetic glyphosate oxidoreductase plants 
showed a markedly higher level of glyphosate oxidoreductase 
expression than that shown for the other glyphosate 
oxidoreductase constructs. This increased glyphosate 
oxidoreductase level could be due to enhancement of translation 
of the fusion or to sequestering of glyphosate oxidoreductase 
within the chloroplast and leading to a longer protein half-life. 
The higher level of glyphosate oxidoreductase and/or its 
location in the chloroplast can result in higher levels of 
glyphosate tolerance through rapid detoxification of glyphosate 
in the chloroplast. The presence of glyphosate oxidoreductase 
within the chloroplast has been confirmed. Five leaves from 
each of four plants (#22844, 22854, 22886, 22887), shown to be 
Western positive for glyphosate oxidoreductase, were 
homogenized in Waring blender in 0.9 L GR+ buffer (Bartlett, et 
al., 1982) for 3 X 3 seconds at high speed. The homogenate was 
filtered through 4 layers of Miracloth and centrifuged at 6,000 
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rpm in a GS-3 rotor. The pellet was resuspended in 4 ml total of 
GR+ buffer and placed on top of a 40/80% Percoll step gradient 
and spun at 9,500 rpm for 10 minutes. The intact chloroplasts 
Gower band) were washed once with GR- buffer (Bartlett, et al., 
1982) and centrifuged (up to 6,000 rpm with brake off). They 
were then resuspended in 300 ul 50 mM Hepes pH 7.7, 330 mM 
Sorbitol and lysed on ice using by sonication (small probe, 30%-3 
microtip setting x 10 seconds). The debris was pelleted and the 
supernatant passed through a Sephadez G50 column into 50 
mM Hepes, pH 7.5. The soluble protein concentration was 2.4 
mg/ml. The enzyme assays were done as above using both 50 
mM IDA and 50 mM glyphosate as substrates (30 minute 
assays), but without the addition of 1 mM ubiquinone. 

Table IX • GlVPhosatft OridoreHnrtflgP Activity in Un\**+A 
CMorOPlflflt from Transg enic Tnhaiann 



Fiinmrlf ft 

A number of transformed lines of canola have been 
derived with vectors pMON17138 (CTPl-synthetic glyphosate 
oxidoreductase) and pMON17164 (CTP2-syrithetic glyphosate 
oxidoreductase) as follows. 

Plant Material 

Seedlings of Brassica napus cv Westar wert 
established in 2 inch (w 5 cm) pots containing Metro Mix 350. 



Substrate 



Specific Activity 

(nmoles/min.mg) 



Iminodiacetic acid 
Glyphosate 



179 
92 
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They were grown in a growth chamber at 24°C f 16/8 hour 
photoperiod, light intensity of 400 uEm-2 S ec-i (HID lamps). 
They were fertilized with Peters 20-10-20 General Purpose 
Special. After 2 1/2 weeks they were transplanted to 6 inch (~ 15 
cm) pots and grown in a growth chamber at 15/10°C day/night 
temperature, 16/8 hour photoperiod, light intensity of 800 uEm- 
2sec-i (HID lamps). They were fertilized with Peters 15-30-15 Hi- 
Phos Special. 

Transformation/Selec tion/Regeneration 

Four terminal internodes from plants just prior to 
bolting or in the process of bolting but before flowering were 
removed and surfaced sterilized in 70% v/v ethanol for 1 
minute, 2% w/v sodium hypochlorite for 20 minutes and rinsed 
3 times with sterile deionized water. Stems with leaves 
attached could be refrigerated in moist plastic bags for up to 72 
hours prior to sterilization. Six to seven stem segments were 
cut into 5mm discs with a Redco Vegetable Slicer 200 
maintaining orientation of basal end. 

The Agrobacterium was grown overnight on a rotator 
at 24°C in 2mls of Luria Broth containing 50mg/l kanamycin, 
24mg/l chloramphenicol and lOOmg/1 spectinomycin. A 1:10 
dilution was made in MS (Murashige and Skoog) media giving 
approximately 9x10* cells per ml. This was confirmed with 
optical density readings at 660 mu. The stem discs (explants) 
were inoculated with 1.0ml of Agrobacterium and the excess 
was aspirated from the explants. 

The explants were placed basal side down in petri 
plates containing 1/10X standard MS salts, B5 vitamins, 3% 
sucrose, 0.8% agar, pH 5.7, 1.0mg/l 6-benzyladenine (BA). The 
plates were layered with 1.5ml of media containing MS salts, B5 
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vitamins, 3% sucrose, pH 5.7, 4.0mg/l p-chlorophenoxyacetic 
acid, 0.005mg/l kinetin and covered with sterile filter paper. 

Following a 2 to 3 day co-culture, the explants were 
transferred to deep dish petri plates containing MS salts, B5 
vitamins, 3% sucrose, 0.8% agar, pH 5.7, lmg/1 BA, 500mg/I 
carbenicillin, 50mg/l cefotaxime, 200 mg/1 kanamycin or 
175mg/l gentamicin for selection. Seven explants were placed 
on each plate. After 3 weeks they were transferred to fresh 
media, 5 explants per plate. The explants were cultured in a 
growth room at 25°C, continuous light (Cool White). 

Expression Assay 

After 3 weeks shoots were excised from the explants. 
Leaf recallusing assays were initiated to confirm modification 
of Ro shoots. Three tiny pieces of leaf tissue were placed on 
recallusing media containing MS salts, B5 vitamins, 3% 
sucrose, 0.8% agar, pH 5.7, 5.0mg/l BA, 0.5mg/l naphthalene 
acetic acid (NAA), 500mg/l carbenicillin, 50mg/l cefotaxime 
and 200mg/l kanamycin or gentamicin or 0.5mM glyphosate. 
The leaf assays were incubated in a growth room under the 
same conditions as explant culture. After 3 weeks the leaf 
recallusing assays were scored for herbicide tolerance (callus 
or green leaf tissue) or sensitivity (bleaching). 

Transplantation 

At the time of excision, the shoot stems were dipped 
in Rootone® and placed in 2 inch (- 5 cm) pots containing 
Metro-Mix 350 and placed in a closed humid environment. 
They were placed in a growth chamber at 24°C, 16/8 hour 
photoperiod, 400 uEm-isec-2(HID lamps) for a hardening-off 
period of approximately 3 weeks. 
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The seed harvested from R 0 plants is R a seed which 
gives rise to R x plants. To evaluate the glyphosate tolerance of 
an R 0 plant, its progeny are evaluated. Because an Ro plant is 
assumed to be hemizygous at each insert location, selfing 
results in maximum genotypic segregation in the Rj. Because 
each insert acts as a dominant allele, in the absence of linkage 
and assuming only one hemizygous insert is required for 
tolerance expression, one insert would segregate 3:1, two 
inserts, 15:1, three inserts 63:1, etc. Therefore, relatively few Ri 
plants need be grown to find at least one resistant phenotype. 

Seed from an Ro plant is harvested, threshed, and 
dried before planting in a glyphosate spray test. Various 
techniques have been used to grow the plants for Ri spray 
evaluations. Tests are conducted in both greenhouses and 
growth chambers. Two planting systems are used; ~ 10 cm pots 
or plant trays containing 32 or 36 cells. Soil used for planting is 
either Metro 350 plus three types of slow release fertilizer or 
plant Metro 350. Irrigation is either overhead in greenhouses 
or subirrigation in growth chambers. Fertilizer is applied as 
required in irrigation water. Temperature regimes 
appropriate for canola were maintained. A sixteen hour 
photoperiod was maintained. At the onset of flowering, plants 
are transplanted to -15 cm pots for seed production. 

A spray "batch" consists of several sets of Ri 
progenies all sprayed on the same date. Some, batches may also 
include evaluations of other than R x plants. Each batch also 
includes sprayed and unsprayed non-transgenic genotypes 
representing the genotypes in the particular batch which were 
putatively transformed. Also included in a batch is one or more 
non-segregating transformed genotypes previously identified as 
having some resistance. 
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10 



Two-six plants from each individual R 0 progeny are 
not sprayed and serve as controls to compare and measure the 
glyphosate tolerance, as well as to assess any variability not 
induced by the glyphosate. When the other plants reach the 2-4 
leaf stage, usually 10 to 20 days after planting, glyphosate is 
applied at rates varying from 0.28 to 1.12 kg/ha, depending on 
objectives of the study. Low rate technology using low volumes 
has been adopted. A laboratory track sprayer has been 
calibrated to deliver a rate equivalent to field conditions. 

A scale of 0 to 10 is used to rate the sprayed plants for 
vegetative resistance. The scale is relative to the unsprayed 
plants from the same R 0 plant. A 0 is death, while a 10 
15 represents no visible difference from the unsprayed plant. A 
higher number between 0 and 10 represents progressively less 
damage as compared to the unsprayed plant. Plants are scored 
at 7, 14, and 28 days after treatment (DAT), or until bolting, and 
a line is given the average score of the sprayed plants within an 
20 R 0 plant family. 

Six integers are used to qualitatively describe the 
degree of reproductive damage from glyphosate: 
0: No floral bud development 
2: Floral buds present, but aborted prior to 
25 opening 

4: Flowers open, but no anthers, or anthers fail 

to extrude past petals 
6: Sterile anthers 
8: Partially sterile anthers 
30 10: Fully fertile flowers 

Plants are scored using this scale at or shortly after 
initiation of flowering, depending on the rate of floral structure 
development. 
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Tables X and XI below tabulate the vegetative and 
reproductive scores for canola plants transformed with 
pMON17138 (sprayed at a rate of 0.56 kg/ha and pMON17164 
(sprayed at a rate of 0.84 kg/ha), respectively. The results 
presented below illustrate the glyphosate tolerance conferred to 
canola plants as a result of expression of a glyphostate 
oxidreductase gene in the plants. 



Table X - Qlvnhosate Sprav Evaluation of Canola Plants 
containing pMON1713fi 



Line name 



Batch 



0.56 kg/ha score o.56kg/h fl a™™ 

14 DAT 28 DAT 

Vegetative Reproductive 



15 



20 



17138-22 

1713*30 

17138-145 

17138-158 

17138-164 



80 
77 
79 



79 



79 
79 
79 



9 10 

9 10 

10 10 
8 10 
8 10 
3 0 
1 0 



Untransformed 
Untransformed 



25 
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Table XI - Glynhoaate Sprav Evalua tion of Hannla Elanta 
containing nMON] f\ frf 

0.84 kg/ha «rn rft 

Construct Batch udat 28 pat 

YggetataYS reproductive 



10 



15 



20 



25 



30 



17164-6 


82 

OA 


7 


ID 


17164-9 




Q 
O 


ID 


17164-20 


82 


7 
i 


1U 


17164-25 


83 


Q 
o 


iU 


17164-35 


84 


7 


in 


1716445 


83 


o 


in 

JXJ 


1716461 


83 


7 


in 

JU 


17164-75 


84 


7 


in 


17164-85 


84 


7 


in 


17164-97 


84 


6 




17164-98 


83 


9 


in 


17164-105 


83 


7 


10 


17164-110 


83 


9 


10 


17164-115 


83 


7 


10 


17164-129 


83 


8 


10 


17164-139 


84 


7 


10 


17164-140 


83 


8 


10 


17164-164 


83 


7 


10 


17164-166 


83 


8 


10 


17164-174 


83 


8 


10 


17164-186 


83 


3 


10 


17164-202 


83 


8 


10 


17164-218 


84 


6 


10 


17164-219 


83 


9 


10 


17164-222 


84 


7 


10 
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17164-225 


83 


7 


10 


17164-227 


84 


7 
i 


10 


17164-230 


83 


8 


10 


17164-243 


83 


7 


10 


17164-247 


84 


7 


10 


17164-287 


84 


7 


10 


17164-289 


83 


8 


10 


17164-300 


83 


9 


10 


17164-337 


83 


8 


10 


Example 4 









The glyphosate oxidoreductase gene has also been 
introduced into and expressed in soybean and imparts 
glyphosate tolerance to such plants. The CTP2-synthetic 
glyphosate oxidoreductase fusion gene (as described above) was 
introduced into soybean under the control of the FMV promoter 
and with the NOS 3* sequences in vector pMON17159, a map of 
which is presented in Figure 10. This vector consists of the 
following elements in addition to the glyphosate oxidoreductase 
gene sequences; the pUC origin of replication, an NPTII 
bacterial selectable marker gene (kanamycin) and the beta- 
glucuronidase gene (GUS; Jefferson et a. 1986) under the 
control of the E35S promoter and with the E9 3' sequences. The 
latter gene provides a scorable marker to facilitate the 
identification of transformed plant material. 

Soybean plants are transformed with pMON17159 by 
the method of microprojectile injection using particle gun 
technology as described in Christou et al. (1988). The seed 
harvested from R c plants is Ri seed which gives rise to Rx 
plants. To evaluate the glyphosate tolerance of an R„ plant, its 
progeny are evaluated.. Because an Ro plant is assumed to be 
hemizygous at each insert location, selfing results in 
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maximum genotypic segregation in the R x . Because each 
insert acts as a dominant aUele, in the absence of linkage and 
assuming only one hemizygous insert is required for tolerance 
expression, one insert would segregate 3:1, two inserts, 15:1, 
three inserts 63:1, etc. Therefore, relatively few Rj plants need 
be grown to find at least one resistant phenotype. 

Seed from an Ro soybean plant is harvested, and 
dried before planting in a glyphosate spray test. Seeds are 
planted into 4 inch (~5cm) square pots containing Metro 350. 
Twenty seedlings from each Ro plant is considered adequate for 
testing. Plants are maintained and grown in a greenhouse 
environment. A 12.5-14 hour photoperiod and temperatures of 
30°C day and 24°C night is regulated. Water soluble Peters Pete 
Lite fertilizer is applied as needed. 

A spray "batch" consists of several sets of R x 
progenies all sprayed on the same date. Some batches may also 
include evaluations of other than R x plants. Each batch also 
includes sprayed and unsprayed non-transgenic genotypes 
representing the genotypes in the particular batch which were 
putatively transformed. Also included in a batch is one or more 
non-segregating transformed genotypes previously identified as 
having some resistance. 

One to two plants from each individual R c progeny 
are not sprayed and serve as controls to compare and measure 
the glyphosate tolerance, as well as to assess any variability not 
induced by the glyphosate. When the other plants reach the 
first trifoliolate leaf stage, usually 2-3 weeks after planting, 
glyphosate is applied at a rate equivalent of 128 oz./acre 
(8.895kg/ha) of Roundup®. A laboratory track sprayer has been 
calibrated to deUver a rate equivalent to those conditions. 

A vegetative* score of 0 to 10 is used. The score is 
relative to the unsprayed progenies from the same R 0 plant. A 
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0 is death, while a 10 represents no visible difference from the 
unsprayed plant. A higher number between 0 and 10 
represents progressively less damage as compared to the 
unsprayed plant Plants are scored at 7, 14, and 28 days after 
treatment (DAT). 



10 



15 



20 



Table XJI - Glvnhosate Sprav Evaluation of Snvhean Plant* 
containing nMONI 71 SQ 



Line 


Batch 


17159-24 


14 


17159-25 


14 


17159-28 


14 


17159-40 


14 


1715943 


14 


17159-71 


14 


17159-77 


14 


1715981 


15 


Untransformed 


14 



Score ® fi.fi9fikg/ha. 2fi DAT 
9 
9 
6 
4 
4 
10 
9 
4 
0 



Example g 

The glyphosate oxidoreductase gene has also been 
introduced into Black Mexican Sweet (BMS) corn cells with 
expression of the protein detected in callus. 

Plasmid pMON19632 was used to introduce the 
glyphosate oxidoreductase gene into corn cells. The backbone 
for this plasmid was constructed by inserting the 0.6kb 
cauliflower mosaic virus (CaMV) 35S RNA promoter (E35S) 
containing a duplication of the -90 to -300 region (Kay et al., 
1987), a 0.58kb fragment containing the first intron from the 
maize alcohol dehydrogenase gene (Callis et al., 1987), and the 
3' termination sequences from the nopaline synthase (NOS) 
gene (Fraley et al., 1983) into pUC119 (Yanisch-Perron et al., 
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1985). pMON19632 was formed by inserting the 1.7kb 
Bglll/EcoRI fragment from pMON17064 which contains the 
Arabidopsis SSU CTP fused to the synthetic glyphosate 
oridoreductase coding sequence (SEQ IN NO:8). 

Plasmid pMON19632 was introduced into BMS corn 
cells by co-bombardment with EC9, a plasmid containing a 
sulfonylurea-resistant form of the maize acetolactate synthase 
gene. 2.5 ug of each plasmid was coated onto tungsten particles 
and introduced into log-phase BMS cells using a PDS-1000 
particle gun essentially as described in Klein et al., 1989. 
Transformants were selected on MS medium containing 20ppb 
chlorsulfuron. After initial selection on chlorsulfuron, the calli 
was assayed by glyphosate oridoreductase Western blot 

BMS callus (3 g wet weight) was dried on filter paper 
(Whatman* 1) under vacuum, reweighed, and extraction buffer 
(500 ul/g dry weight; 100 mM Tris, 1 mM EDTA, 10% glycerol) 
was added. The tissue was homogenized with a Wheaton 
overhead stirrer for 30 seconds at 2.8 power setting. After 
centrifugation (3 minutes, Eppendorf microfuge), the 
supernatant was removed and the protein was quantitated 
(BioRad Protein Assay). Samples (50 ug/well) were loaded on 
an SDS PAGE gel (Jule, 3-17%) along with glyphosate 
oridoreductase standard (10 ng), electrophoresed, and 
transferred to nitrocellulose similarly to a previously described 
method (Padgette, 1987). The nitrocellulose blot was probed 
with goat anti-glyphosate oridoreductase IgG, and developed 
with 1-125 Protein G. The radioactive blot was visualized by 
autoradiography. Results were quantitated by densitometry on 
an LKB UltraScan XL laser densitomer and are tabulated below 
in Table XHI. 
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Table XITT - Expression of givnhnsatP aadoisductaafi in FMF 
Corn Callus uang pMOTvnpffffl 



10 



15 



20 



line iStsskaBtsdsacatsM 



EC9 (no GOX) 0 
T13-17 0.016 

T13-16 0.0065 

T13-15 0.016 

T13-14 0.003 

T13-12 0.0079 

T13-7 0.01 

T13-5 0.004 

T13-18 0.026 

T13-8 0.019 

T13-9 0.01 

T134 0.027 

Table XIII illustrates that glyphosate ozidoreductase 
can be expressed and detected in a monocotyledonous plant, 
such as corn. 



Example « 

The glyphosate oxidoreductase gene may be used as a 
selectable marker for plant transformation directly on media 
containing glyphosate. The ability to select and to identify 
transformed plant material depends, in most cases, on the use 
of a dominant selectable marker gene to enable the preferential 
and continued growth of the transformed tissues in the 
presence of a normally inhibitory substance. Antibiotic 
resistance and herbicide tolerance genes have been used almost 
exclusively as such dominant selectable marker genes in the 
presence of the corresponding antibiotic or herbicide. The 
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nptll/kanamycin selection scheme is probably the most 
frequently used. It has been demonstrated that glyphosate 
oxidoreductase is also a useful and perhaps superior selectable 
marker/selection scheme for producing and identifying 
transformed plants. 

A plant transformation vector that may be used in 
this scheme is pMON17226 (Figure 11). This plasmid 
resembles many of the other plasmids described infra and is 
essentially composed of the previously described bacterial 
replicon system that enables this plasmid to replicate in E. coli 
and to be introduced into and to replicate in Agrobacterium, the 
bacterial selectable marker gene (Spc/Str), and located between 
the T-DNA right border and left border is the CTPl-glyphosate 
oxidoreductase synthetic gene in the PMV promoter-E9 3" 
cassette. This plasmid also has single sites for a number of 
restriction enzymes, located within the borders and outside of 
the expression cassette. This makes it possible to easily add 
other genes and genetic elements to the vector for introduction 
into plants. 

The protocol for direct selection of transformed plants 
on glyphosate is outlined for tobacco. Explants are prepared for 
pre-culture as in the standard procedure as described in 
Example 1: surface sterilization of leaves from 1 month old 
tobacco plants (15 minutes in 10% clorox + surfactant; 3X dH 2 0 
washes); explants are cut in 0.5 x 0.5 cm squares, removing leaf 
edges, mid-rib, tip, and petiole end for uniform tissue type; 
explants are placed in single layer, upside down, on MS104 
plates + 2 ml 4C005K media to moisten surface; pre-culture 1-2 
days. Explants are inoculated using overnight culture of 
Agrobacterium containing the plant transformation plasmid 
that is adjusted to a titer of 1.2 X 109 bacteria/ml with 4C005K 
media. Explants are placed into a centrifuge tube, the 
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Agrobacterium suspension is added and the mixture of bacteria 
and explants is "Vortexed" on maximum settting for 25 seconds 
to ensure even penetration of bacteria. The bacteria are poured 
off and the explants are blotted between layers of dry sterile 
filter paper to remove excess bacteria. The blotted explants are 
placed upside down on MS 104 plates + 2ml 4C005K media + 
filter disc. Co-culture is 2-3 days. The explants are transferred 
to MS104 + Carbenicillin 1000 mg/1 + cefotaxime 100 mg/1 for 3 
days (delayed phase). The explants are then transferred to 
MS104 + glyphosate 0.05 mM + Carbenicillin 1000 mg/1 + 
cefotaxime 100 mg/1 for selection phase. At 4-6 weeks shoots are 
cut from callus and placed on MSO + Carbenicillin 500 mg/1 
rooting media. Roots form in 3-5 days, at which time leaf pieces 
can be taken from rooted plates to confirm glyphosate tolerance 
and that the material is transformed. 

The presence of the glyphosate oxidoreductase 
protein in these transformed tissues has been confirmed by 
immunoblot analysis of leaf discs. The data from one 
experiment with pMON17226 is presented in the following: 25 
shoots formed on glyphosate from 100 explants inoculated with 
Agrobacterium ABI/pMON17226; 15 of these were positive on 
recallusing on glyphosate, and 19 of these were positive for 
glyphosate oxidoreductase protein as detected by immunoblot. 
These data indicate a transformation rate of 15-19 per 100 
explants, which makes this a highly efficient and time saving 
transformation procedure for plant. Similar transformation 
frequencies have been obtained with a pMON17226 derivative 
(pMON 17241) containing the gene for the glyphosate 
oxidoreductase v.247 (SEQ ID NO: 17). The glyphosate 
oxidoreductase gene has also been shown to enable direct 
selection of transformants in other plant species, including 
Arabidopsis, potato, and sugarbeet. 
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From the foregoing, it will be seen that this invention 
is one well adapted to attain all the ends and objects 
hereinabove set forth together with advantages which are 
obvious and which are inherent to the invention. 

It will be understood that certain features and 
subcombinations are of utility and may be employed without 
reference to other features and subcombinations. This is 
contemplated by and is within the scope of the claims. 

Since many possible embodiments may be made of 
the invention without departing from the scope thereof, it is to 
be understood that all matter herein set forth or shown in the 
accompanying drawings is to be interpreted as illustrative and 
not in a limiting sense. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Kishore, Ganesh M. 

Barry, Gerard F. 

(ii) TITLE OF INVENTION: Glyphosate Resistant Plants 
(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Thomas P. McBride, Monsanto Co. BB4F 

(B) STREET: 700 Chesterfield Village Parkway 

(C) CITY: St. Louis 

(D) STATE: Missouri 

(E) COUNTRY: USA 

(F) ZIP: 63198 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: McBride, Thomas P. 

(B) REGISTRATION NUMBER: 32706 

(C) REFERENCE/DOCKET NUMBER: 38-21(10533) 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (314)537-7357 



(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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ATTTAGCAGC ATTCCAGATT GGGTTCAATC AACAAGGTAC CAGCCATATC ACTTTATTCA 60 

AATTGGTATC GCCAAAACCA AGAAGGAACT CCCATCCTCA AAGGTTTGTA AGGAAGAATT 120 

CTCAGTCCAA AGCCTCAACA AGGTCAGGGT ACAGAGTCTC CAAACCATTA GCCAAAAGCT 180 

ACAGGAGATC AATGAAGAAT CTTCAATCAA AGTAAACTAC TGTTCCAGCA CATGCATCAT 240 

GGTCAGTAAG TTTCAGAAAA AGACATCCAC CGAAGACTTA AAGTTAGTGG GCATCTTTGA 300 

AAGTAATCTT GTCAACATCG AGCACCTGGC TTGTGGGGAC CAGACAAAAA AGGAATGGTG 360 

CAGAATTGTT AGGCGCACCT ACCAAAAGCA TCTTTGCCTT TATTGCAAAA GATAAAGCAG 420 

ATTCCTCTAG TACAAGTGGG GAACAAAATA ACGTGGAAAA GAGCTGTCCT GACAGCCCAC 480 

TCACTAATGC GTATGACGAA CGCAGTGACG ACCACAAAAG AATTTTCCCT CTATATAAGA 540 

AGGCATTTCA TTCCCATTTG AAGG 564 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 2: 
ATCATCAGAT ACTAACCAAT ATTTCTC 27 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

NCATGGACGT CTGATCGAAA TCGTCGTTAC CGCAGCAAGG TAAGGCACGC CGAATTTTAT 60 

CACCTACCGC GAAACGGTGG CTAGGCAGCG AGAGACTGTC GGCTCCGCGG GAGCATCCTA 120 

TGTCTGAGAA CCACAAAAAA GTAGGCATCG CTGGAGCCGG AATCGTCGGC GTATGCACGG 180 
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C0CT0ATOCT TCAGCGCCGC GGATTCAAAG TCACCTTQAT TOACCOGAAC CCTCCTOCCO 340 

AAOOTGGATC OTTTOOOAAT GCOGCATOCT tCAAOGOCTC ATCCCTCOTC CCTAtCTCCA 300 

T0CCOGGAAA CTTOACGAGC GTGCCGAAOT OOCTCCTTGA CCCOATOOOC COTTOTCAAT 360 

CCCOTTCACC TATTTCCAAC CATCATGCCT OOTXOATTOG CTTTCTGTTA CCCOOAACAC 420 

GAAACAAGGT CAAGGAGCAO OCGAAAGCAC TCCOCAATCT CATCAACTCC ACGGTOfcCTC 460 

TCATCAACTC ATTCCOCaAO CAOCCTCATO CGAOCCATCT CATCCOCCAT OAAOOTCATC 540 

TGACCCTATA TCOTGGAGAA CCAOACTTCG CCAAOOACCO CGGAGGTTOG OAACTOCCGC 600 

CTCTCAACOC TOTTCGCACG CAGATCCTCA OCCCCGATOC GTTOCOOCAt TTCCATCCGA 660 

ACTTOTCOCA TGCCTTTACC AACGCCATTC TTAXAGAAGA GAACGCTCAC AOGATTAATC 720 

CCCAACGCCT CCTCACCCTC TTGTTTCGGC CTTtTATCOC OAACOGTGGC GAATTCGTAT 760 

CTGCGCCTO? CATCGCCTTT CAGACIGAAO GTACGGCOCT TAAAOOCATT ACAACCACGA 840 

ACGGCGTTCT GOCCOTTOAT OCAOCCGTTG TCOCAGCCCO CCCACACTCO AAATCACTTG *00 

CTAATTCCCT ACGCGATGAC ATCCOOCTCO ATACCGAACO TGGATATCAT ATCOTCATCO 960 

CGAATCCGCA AOCCOCTCCA CGCATTCCOA CGACOGATGC GTCAGGAAAA TTCATOOCGA 1020 

CACCTATGGA AATOOOGCTT CGCOTOOCGO GTACGGTTGA OTTCGCTOOO CtCACAOCCC 1060 

CTCCTAACTO GAAACOTGCO CATOTCCTCT ATACGCAOGC TCGAAAACTT CTTCCAGCCC 1140 

TCOCGCCTGC GAGTTCTGAA GAACGATA7T CCAAATOGAT GOGGTTCCGO CCGAGCATCC 1200 

CGGATTCGCT CCCOGTGATT GGCCOOGCAA CCCOCACACC CGACOTAATC TATCCTTTCG 1260 

CtCATCOTCA TCTCGGCATG ACAOOOOCGC OOATGACOCC AACGCTCGTC TCAGA&CTCC 1320 

TCGCAGCCOA AAAGACCTCA ATCOACATTT CGCCCTTOGC ACCAAACCQC TTTOGtATTG 1300 

GCAAATCCAA GCAAACGGGT CCOOCAAGW AAGTACTTAC GCOOTCOTGA CTACAOCOCA 1440 

GAGCCOOTOT CAAGATCAAT CTCCACCTCO CAATCACCTC GGAOACOCGA AATCGCOCAA 1500 

ATAGAACAGA TATTAAOGAO TCACOCCCOO AAOCCTTTOO OTCACTACAG TCAGCCGOCC 1660 

» 

CGACCCCCTO GATTCATTCA TGTTTCCGGT CAOCTTCOGA TGAAACCAGA ACOCCAGTCO 1620 

CAGCAATCTC ACOATCTCOT CGATAACCAO GCCAGTCTCO TTCTCCOOAA TTTGCTOGCC 1680 

GTACTCGAO l689 
(2) INFORMATION FOA 8SQ ID HOt4s 

fi) **QUENCK CHAKACnAXtTZC8i 
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(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1293 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG TCT GAG AAC CAC AAA AAA GTA GGC ATC GCT GGA GCC GGA ATC GTC 48 
Met Ser Glu Asn His Lys Lys Val Gly He Ala Gly Ala Gly He Val 

1 5. 10 15 



GGC GTA TGC ACG GCG CTG ATG CTT CAG CGC CGC GGA TTC AAA GTC ACC 96 
Gly Val Cys Thr Ala Leu Met Leu Gin Arg Arg Gly Phe Lys Val Thr 

20 25 30 



TTG ATT GAC CCG AAC CCT CCT GGC GAA GGT GCA TCG TTT GGG AAT GCC 144 
Leu He Asp Pro Asn Pro Pro Gly Glu Gly Ala Ser Phe Gly Asn Ala 

35 40 45 



GGA TGC TTC AAC GGC TCA TCC GTC GTC CCT ATG TCC ATG CCG GGA AAC 192 
Gly Cys Phe Asn Gly Ser Ser Val Val Pro Met Ser Met Pro Gly Asn 

50 55 60 



TTG ACG AGC GTG CCG AAG TGG CTC CTT GAC CCG ATG GGC CGT TGT CAA 240 
Leu Thr Ser Val Pro Lys Trp Leu Leu Asp Pro Met Gly Arg Cys Gin 

65 70 75 80 



TCC GGT TCA GCT ATT TCC AAC CAT CAT GCC TGG TTG ATT CGC TTT CTG 288 
Ser Gly Ser Ala He Ser Asn His His Ala Trp Leu He Arg Phe Leu 

85 90 95 



TTA GCC GGA AGA CCA AAC AAG GTG AAG GAG CAG GCG AAA GCA CTC CGC 336 
Leu Ala Gly Arg Pro Asn Lys Val Lys Glu Gin Ala Lys Ala Leu Arg 
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100 105 HO 

AAT CTC ATC AAG TCC ACG GTG CCT CTG ATC AAG TCA TTG GCG GAG GAG 384 
Asn Leu lie Lys Ser Thr Val Pro Leu lie Lye Ser Leu Ala Glu Glu 

115 120 125 

GCT GAT GCG AGC CAT CTG ATC CGC CAT GAA GGT CAT CTG ACC GTA TAT 432 
Ala Asp Ala Ser His Leu He Arg His Glu Gly His Leu Thr Val Tyr 

130 135 140 

CGT GGA GAA GCA GAC TTC GCC AAG GAC CGC GGA GGT TGG GAA CTG CGG 480 
Arg Gly Glu Ala Asp Phe Ala Lys Asp Arg Gly Gly Trp Glu Leu Arg 

145 150 155 160 

CGT CTC AAC GGT GTT CGC ACG CAG ATC CTC AGC GCC GAT GCG TTG CGG 528 
Arg Leu Asn Gly Val Arg Thr Gin He Leu Ser Ala Asp Ala Leu Arg 

165 170 175 



GAT TTC GAT CCG AAC TTG TCG CAT GCG TTT ACC AAG GGC ATT CTT ATA 576 
Asp Phe Asp Pro Asn Leu Ser His Ala Phe Thr Lys Gly He Leu He 

180 185 190 



GAA GAG AAC GGT CAC ACG ATT AAT CCG CAA GGG CTC GTG ACC CTC TTG 624 
Glu Glu Asn Gly His Thr He Asn Pro Gin Gly Leu Val Thr Leu Leu 

195 200 205 



TTT CGG CGT TTT ATC GCG AAC GGT GGC GAA TTC GTA TCT GCG CGT GTC 672 
Phe Arg Arg Phe He Ala Asn Gly Gly Glu Phe Val Ser Ala Arg Val 

210 215 220 

ATC GGC TTT GAG ACT GAA GGT AGG GCG CTT AAA GGC ATT ACA ACC ACG 720 
He Gly Phe Glu Thr Glu Gly Arg Ala Leu Lys Gly He Thr Thr Thr 

225 230 235 240 

AAC GGC GTT CTG GCC GTT GAT GCA GCG GTT GTC GCA GCC GGC GCA CAC 768 
Asn Gly Val Leu Ala Val Asp Ala Ala Val Val Ala Ala Gly Ala His 
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245 250 255 

TCG AAA TCA CTT GCT AAT TCG CTA GGC GAT GAC ATC CCG CTC GAT ACC 816 
Ser Lye Ser Leu Ala Asn Ser Leu Gly Asp Asp lie Pro Leu Asp Thr 

260 265 270 

GAA CGT GGA TAT CAT ATC GTC ATC GCG AAT CCG GAA GCC GCT CCA CGC 864 
Glu Arg Gly Tyr Hia He Val He Ala Asn Pro Glu Ala Ala Pro Arg 

275 280 285 

ATT CCG ACG ACC GAT GCG TCA GGA AAA TTC ATC GCG ACA CCT ATG GAA 912 
He Pro Thr Thr Asp Ala Ser Gly Lys Phe lie Ala Thr Pro Met Glu 

290 295 300 

ATG GGG CTT CGC GTG GCG GGT ACG GTT GAG TTC GCT GGG CTC ACA GCC 960 
Met Gly Leu Arg Val Ala Gly Thr Val Glu Phe Ala Gly Leu Thr Ala 

305 310 315 320 

GCT CCT AAC TGG AAA CGT GCG CAT GTG CTC TAT ACG CAC GCT CGA AAA 1008 
Ala Pro Asn Trp Lye Arg Ala His Val Leu Tyr Thr His Ala Arg Lys 

325 330 335 

CTT CTT CCA GCC CTC GCG CCT GCG AGT TCT GAA GAA CGA TAT TCC AAA 1056 
Leu Leu Pro Ala Leu Ala Pro Ala Ser Ser Glu Glu Arg Tyr Ser Lys 

340 345 350 

TGG ATG GGG TTC CGG CCG AGC ATC CCG GAT TCG CTC CCC GTG ATT GGC 1104 
Trp Met Gly Phe Arg Pro Ser He Pro Asp Ser Leu Pro Val He Gly 

355 360 365 

CGG GCA ACC CGG ACA CCC GAC GTA ATC TAT GCT TTC GGC CAT GGT CAT 1152 
Arg Ala Thr Arg Thr Pro Asp Val lie Tyr Ala Phe Gly His Gly His 

370 375 380 

CTC GGC ATG ACA GGG GOG CCG ATG ACC GCA ACG CTC GTC TCA GAG CTC 1200 
Leu Gly Met Thr Gly Ala Pro Met Thr Ala Thr Leu Val Ser Glu Leu 
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385 390 395 400 

CTC GCA GGC GAA AAG ACC TCA ATC CAC ATT TCG CCC TTC CCA CCA AAC 1248 
Leu Ala Gly Glu Lys Thr Ser lie Asp lie Ser Fro Phe Ala Pro Asn 

405 410 415 

CGC TTT GGT ATT GGC AAA TCC AAG CAA ACG GGT CCG GCA ACT TAA 1293 
Arg Phe Gly lie Gly Ly8 Ser Lys Gin Thr Gly Pro Ala Ser 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Glu Asn His Lys Lys Val Gly lie Ala Gly Ala Gly He Val 

15 10 15 

Gly Val Cya Thr Ala Leu Met Leu Gin Arg Arg Gly Phe Lys Val Thr 
20 25 30 

Leu He Asp Pro Asn Pro Pro Gly Glu Gly Ala Ser Phe Gly Asn Ala 
35 40 45 

Gly Cys Phe Asn Gly Ser Ser Val Val Pro Met Ser Met Pro Gly Asn 
50 55 60 

Leu Thr Ser Val Pro Lys Trp Leu Leu Asp Pro Met Gly Arg Cys Gin 
65 70 75 80 

Ser Gly Ser Ala He Ser Asn His His Ala Trp Leu He Arg Phe Leu 
85 90 95 
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Leu Ala Gly Arg Pro Aon Lye Val Lye Glu Gin Ala Lye Ala Leu Arg 
100 105 110 

Asn Leu He Lys Ser Thr Val Pro Leu He Lye Ser Leu Ala Glu Glu 
115 120 125 

Ala Asp Ala Ser His Leu He Arg Hie Glu Gly His Leu Thr Val Tyr 
130 135 140 

Arg Gly Glu Ala Asp Phe Ala Lys Asp Arg Gly Gly Trp Glu Leu Arg 
145 150 155 160 

Arg Leu Asn Gly Val Arg Thr Gin He Leu Ser Ala Asp Ala Leu Arg 
165 170 175 

Asp Phe Asp Pro Asn Leu Ser His Ala Phe Thr Lys Gly He Leu He 
180 185 190 

Glu Glu Asn Gly His Thr He Asn Pro Gin Gly Leu Val Thr Leu Leu 
195 200 205 

Phe Arg Arg Phe He Ala Asn Gly Gly Glu Phe Val Ser Ala Arg Val 
210 215 220 

He Gly Phe Glu Thr Glu Gly Arg Ala Leu Lys Gly He Thr Thr Thr 
225 230 235 240 

Asn Gly Val Leu Ala Val Asp Ala Ala Val Val Ala Ala Gly Ala His 
245 250 255 



Ser Lys Ser Leu Ala Asn Ser Leu Gly Asp Asp He Pro Leu Asp Thr 
260 265 270 
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Glu Arg Cly Tyr His He Val He Ala Aan Pro Glu Ala Ala Pro Arg 
275 280 285 

lie Pro Thr Thr Asp Ala Ser Gly Lys Phe He Ala Thr Pro Met Glu 
290 295 300 

Met Gly Leu Arg Val Ala Gly Thr Val Glu Phe Ala Gly Leu Thr Ala 
305 310 315 320 

Ala Pro Asn Trp Lys Arg Ala His Val Leu Tyr Thr His Ala Arg Lys 
^25 330 335 

Leu Leu Pro Ala Leu Ala Pro Ala Ser Ser Glu Glu Arg Tyr Ser Lys 
340 345 350 

Trp Met Gly Phe Arg Pro Ser He Pro Asp Ser Leu Pro Val He Gly 
355 360 365 

Arg Ala Thr Arg Thr Pro Asp Val He Tyr Ala Phe Gly Bis Gly His 
370 375 380 

Leu Gly Met Thr Gly Ala Pro Met Thr Ala Thr Leu Val Ser Glu Leu 
385 390 395 400 

Leu Ala Gly Glu Lys Thr Ser He Asp He Ser Pro Phe Ala Pro Asn 
405 410 415 



Arg Phe Gly lie Gly Lys Ser Lys Gin Thr Gly Pro Ala Ser 
420 425 * 430 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 



WO 92/00377 



PCT/US91/04514 



- 101 - 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGGCTGAGA ACCACAAAAA AGTAGGCATC GCTGGAGCCG GAATCGTCGG CGTATGCACG 60 

GCGCTGATGC TTCAGCGCCG CGGATTCAAA GTCACCTTGA TTGACCCGAA CCCTCCTGGC 120 

GAAGGTGCAT CGTTTGGGAA TGCCGGATGC TTCAACGGCT CATCCGTCGT CCCTATGTCC 180 

ATGCCGGGAA ACTTGACGAG CGTGCCGAAG TGGCTCCTTG ACCCGATGGG GCCGTTGTCA 240 

ATCCGGTTCA GCTATTTTCC AACCATCATG CCCTGGTTGA TTCGCTTTCT GTTAGCCGGA 300 

AGACCAAACA AGGTGAAGGA GCAGGCGAAA GCACTCCGCA ATCTCATCAA GTCCACGGTG 360 

CCTCTGATCA AGTCATTGGC GGAGGAGGCT GATGCGAGCC ATCTGATCCG CCATGAAGGT 420 

CATCTGACCG TATATCGTGG AGAAGCAGAC TTCGCCAAGG ACCGCGGAGG TTGGGAACTG 480 

CGGCGTCTCA ACGGTGTTCG CACGCAGATC CTCAGCGCCG ATGCGTTGCG GGATTTCGAT 540 

CCGAACTTGT CGCATGCGTT TACCAAGGGC ATTCTTATAG AAGAGAACGG TCACACGATT 600 

AATCCGCAAG GGCTCGTGAC CCTCTTGTTT CGGCGTTTTA TCGCGAACGG TGGCGAATTT 660 

GTATCTGCGC GTCTCATCGG CTTTGAGACT CAAGGTAGGG CGCTTAAAGG CATTACAACC 720 

ACGAACGGCG TTCTGCCCGT TGATGCAGCG GTTGTCGCAG CCGGCGCACA CTCGAAATCA 780 

CTTGCTAATT CGCTAGGCGA TGACATCCCG CTCGATACCG AACGTGGATA TCATATCGTC 840 

ATCGCGAATC CGGAAGCCGC TCCACGCATT CCGACGACCG ATGCGTCAGG AAAATTCATC 900 

GCGACACCTA TGGAAATGGG GCTTCGCGTG GCGGGTACGG TTGAGTTCGC TGGGCTCACA 960 

GCCGCTCCTA ACTGGAAACG TGCGCATGTG CTCTATACGC ACGCTCGAAA ACTTCTTCCA 1020 

GCCCTCGCGC CTGCGAGTTC TGAAGAACGA TATTCCAAAT GGATCGGGTT CCGGCCGAGC 1080 

ATCCCGGATT CGCTCCCCGT GATTGGCCGG GCAACCCGGA CACCCGACGT AATCTATGCT 1140 

TTCGGCCACG GTCATCTCGG CATGACAGGG GCGCCGATGA CCGCAACGCT CGTCTCAGAG 1200 

CTCCTCGCAG GCGAAAAGAC CTCAATCGAC ATTTCGCCCT TCGCACCAAA CCGCTTTGGT 1260 

ATTGGCAAAT CCAAGCAAAC GGGTCCGGCA AGTTAA 1296 
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<2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATGGCTGAGA 


ACCACAAAAA AGTAGGCATC 


GCTGGAGCTG GAATCGTTGG TCT&Trp&rn> 


60 


GCTTTGATGC 


TTCAACGTCG 


TGGATTCAAA 


GTCACCTTGA TTGACCCGAA cenrrrvnnn 


120 


GAAGGTGCAT 


CGTTTGGGAA 


TGCCGGATGC 


TTCAACGGCT CATCCGTCGT CCCTATGTrr 




ATGCCGGGAA 


ACTTGACGAG 


CGTGCCGAAG 


TGGCTCCTTG ACCCGATGGG GCCGTTGTPA 




ATCCGGTTCA 


GCTATTTTCC AACCATCATG 


CCCTGGTTGA TTCGCTTTCT GTTAGCCGCA 




AGACCAAACA 


AGGTGAAGGA GCAGGCGAAA 


GCACTCCGCA ATCTCATCAA GTCCACGGTG 


JoU 


CCTCTGATCA 


AGTCATTGGC GGAGGAGGCT 


GATGCGAGCC ATCTGATCCG CCATGAAGGT 




CATCTGACCG 


TATATCGTGG 


AGAAGCAGAC 


TTCGCCAAGG ACCGCGGAGG TTGGGAACTG 


480 


CGGCGTCTCA 


ACGGTGTTCG 


CACGCAGATC 


CTCTCTGCTG ATGCTTTGCC Tr.&TT^prBT 


540 


CCTAACTTGT 


CGCATGCTTT < 


TACCAAGGGC 


ATTCTTATAG AAGAGAACGG TCACACGATT 


600 


AATCCGCAAG 


GGCTCGTGAC 


CCTCTTGTTT 


CGGCGTTTTA TCGCGAACGG TGGCGAATTT 


660 


GTATCTGCGC 


GTGTCATCGG 


TTTTGAGACT 


GAAGGTCGTG CTCTCAAACG CATTACAACC 


720 


ACTAACGGTG 


TTCTGGCTGT 


TGATGCAGCT 


GTTGTTGCAG CTGGTGCACA CTCTAAATCA 


780 


CTTGCTAATT 


CGCTAGGCGA 


TGACATCCCG 


CTCGATACCG AACGTGGATA TCATATCGTC 


840 


ATCGCGAATC 


CGGAAGCCGC 


TCCACGCATT 


CCGACGACCG ATGCGTCAGG AAAATTCATC 


900 


GCGACACCTA 


TGGAAATGGG 


TCTTCGTGTT 


GCTGGTACTG TTGA6TTTGC TGGTCTCACA 


960 


GCTGCTCCTA 


ACTGGAAACG 


TGCGCATGTG 


CTCTATACGC ACGCTCGAAA ACTTCTTCCA 


1020 


GCCCTCGCGC 


CTGCGAGTTC 


TGAAGAACGA 


TATTCCAAAT GGATGGGTTT TCGTCCTAGC 


1080 


ATTCCTGATT 


CTCTTCCAGT 


GATTGGTCGT 


GCAACTCGTA CACCCCACGT AATCTATGCT 


1140 


TTTGGTCACG 


GTCATCTCGG 


TATGACAGGT 


GCTCCAATGA CTGCAACTCT CGTCTCAGAG 


1200 
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CTCCTCGCAG GCGAAAAGAC CTCAATCGAC ATTTCGCCCT TCGCACGAAA CCGCTTTGGT 1260 

ATTGGCAAAT CCAAGCAAAC GGGTCCGGCA AGTTAA 1296 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGGCTGAGA ACCACAAGAA GGTfGGTATC GCTGGAGCTG GAATCGTTGG TGTTTGCACT 60 

GCTTTGATGC TTCAACGTCG TGGATTCAAG GTTACCTTGA TTGATCCAAA CCCACCAGGT 120 

GAAGGTGCTT CTTTCGGTAA CGCTGGTTGC TTCAACGGTT CCTCCGTTGT TCCAATGTCC 180 

ATGCCAGGAA ACTTGACTAG CGTTCCAAAG TGGCTTCTTG ACCCAATGGG TCCATTGTCC 240 

ATCCGTTTCA GCTACTTTCC AACCATCATG CCTTGGTTGA TTCGTTTCTT GCTTGCTGGA 300 

AGACCAAACA AGGTGAAGGA GCAAGCTAAG GCACTCCGTA ACCTCATCAA GTCCACTGTG 360 

CCTTTGATCA AGTCCTTGGC TGAGGAGGCT GATGCTAGCC ACCTTATCCG TCACGAAGGT 420 

CACCTTACCG TGTACCGTGG AGAAGCAGAC TTCGCCAAGG ACCGTGGAGG TTGGGAACTT 480 

CGTCGTCTCA ACGGTGTTCG TACTCAAATC CTCAGCGCTG ATGCATTGCG TGATTTCGAT 540 

CCTAACTTGT CTCACGCCTT TACCAAGGGA ATCCTTATCG AAGAGAACGG TCACACCATC 600 

AACCCACAAG GTCTCGTGAC TCTCTTGTTT CGTCGTTTCA TCGCTAACGG TGGAGAGTTC 660 

GTGTCTGCTC GTGTTATCGG ATTCGAGACT GAAGGTCGTG CTCTCAAGGG TATCACCACC 720 

ACCAACGGTG TTCTTGCTGT TGATGCAGCT GTTGTTGCAG CTGGTGCACA CTCCAAGTCT 780 

CTTGCTAACT CCCTTGGTGA TGACATCCCA TTGGATACCG AACGTGGATA CCACATCGTG 840 

ATCGCCAACC CAGAAGCTGC TCCACGTATT CCAACTACCG ATGCTTCTGG AAAGTTCATC 900 

GCTACTCCTA TGGAGATGGG TCTTCGTGTT GCTGGAACCG TTGAGTTCGC TGGTCTCACT 960 

GCTGCTCCTA ACTGGAAGCG TGCTCACGTT CTCTACACTC ACGCTCGTAA GTTGCTTCCA 1020 

GCTCTCGCTC CTGCCAGTTC TGAAGAACGT TACTCCAAGT GGATGGGTTT CCGTCCAAGC 1080 
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ATCCCAGATT CCCTTCCAGT GATTGGTCGT GCTACCCGTA CTCCAGACGT TATCTACGCT 
TTCGGTCACG GTCACCTCGG TATGACTGGT GCTCCAATGA CCGCAACCCT CGTTTCTGAG 
CTCCTCGCAG GTGAGAAGAC CTCTATCGAC ATCTCTCCAT TCGCACCAAA CCGTTTCGGT 
ATTGGTAAGT CCAAGCAAAC TGGTCCTGCA TCCTAA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AGATCTCCAC AATGGCTTCC TCTATGCTCT CTTCCGCTAC TATGGTTGCC TCTCCGGCTC 

AGGCCACTAT GGTCGCTCCT TTCAACGGAC TTAAGTCCTC CGCTGCCTTC CCAGCCACCC 

GCAAGGCTAA CAACGACATT ACTTCCATCA CAAGCAACGG CGGAAGAGTT AACTGCATGC 

AGGTGTGGCC TCCGATTGGA AAGAAGAAGT TTGAGACTCT CTCTTACCTT CCTGACCTTA 

CCGATTCCGG TGGTCGCGTC AACTGCATGC AGGCCATGG 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGATCTATCG ATAAGCTTGA TGTAATTGGA GGAAGATCAA AATTTTCAAT CCCCATTCTT 
CGATTGCTTC AATTGAAGTT TCTCCGATGG CGCAAGTTAG CAGAATCTGC AATGGTGTGC 
AGAACCCATC TCTTATCTCC AATCTCTCGA AATCCAGTCA ACGCAAATCT CCCTTATCGG 
TTTCTCTGAA GACGCAGCAG CATCCACGAG CTTATCCGAT TTCGTCGTCG TGGGGATTGA 
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AGAAGAGTGG CATCACGTTA ATTCCCTCTG AGCTTCGTCC TCTTAAGCTC ATGTCTTCTG 300 

TTTCCACGGC GTGCATGC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



NCATGGACGT CTGATCGAAA TCGTCGTTAC CGCAGCAAGG TAAGGCACGC CGAATTTTAT 
CACCTACCGC GAAACGGTGG CTAGGCAGCG AGAGACTGTC GGCTCCGCGG GAGCATCCT 
(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 i 



GTACTTACGC GGTCGTGAGT ACAGCGCAGA GCCGGTGTCA AGATCAATCT GCACCTCGCA 
ATCACCTCGG AGACGCGAAA TGGCGCAAAT AGAACACATA TTAACGAGTC ACGCCCCGAA 
GCCTTTGGGT CACTACAGTC AGGCGGCCCG AGCGGGTGGA TTCATTCATG TTTCCGGTCA 
GCTTCCGATC AAACCAGAAG GCCAGTCGGA GCAATCTGAC GATCTCGTCG ATAACCAGGC 
CAGTCTCGTT CTCCGGAATT TGCTGGCCGT ACTCGAG 
(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (synthetic) 



PCT/US91/04S14 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAGAGACTGT CGACTCCGCG GGAGCATCAT ATG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
GAAOGAATCC AAGCTTCTCA CGACCGCGTA AGTAC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCCGAGATGA CCGTGGCCGA AAGC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



♦ 
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GGG AATGCCG GATGCTTCAA CGGC 24 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (recombinant) 



(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..1296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG GCT GAG AAC CAC AAG AAG GTT GGT ATC GCT GGA GCT GGA ATC GTT 48 
Met Ala Glu Asn His Lys Lys Val Gly He Ala Gly Ala Gly He Val 

15 10 15 



GGT GTT TGC ACT GCT TTG ATG CTT CAA CGT CGT GGA TTC AAG GTT ACC 96 
Gly Val Cya Thr Ala Leu Met Leu Gin Arg Arg Gly Phe Lys Val Thr 

20 25 30 



TTG ATT GAT CCA AAC CCA CCA GGT GAA GGT GCC TCT TTC GGT AAC GCT 144 
Leu He Asp Pro A8n Pro Pro Gly Glu Gly Ala Ser Phe Gly Asn Ala 

35 40 45 



GGT TGC TTC AAC GGT TCC TCC GTT GTT CCA ATG TCC ATG CCA GGA AAC 192 
Gly Cys Phe Asn Gly Ser Ser Val Val Pro Met Ser Met Pro Gly Asn 

50 55 60 



TTG ACT AGC GTT CCA AAG TGG CTT CTT GAC CCA ATG GGT CCA TTG TCC 240 
Leu Thr Ser Val Pro Lys Trp Leu Leu Asp Pro Met Gly Pro Leu Ser 

65 70 75 ' 80 



ATC CGT TTC GGC TAC TTT CCA ACC ATC ATG CCT TGG TTG ATT CGT TTC 288 
He Arg Phe Gly Tyr Phe Pro Thr He Met Pro Trp Leu He Arg Phe 



85 



90 



95 
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TTG CTT GCT GGA AGA CCA AAC AAG GTG AAG GAG CAA GCT AAG GCA CTC 336 
Leu Leu Ala Gly Arg Pro Asn Lys Val Lya Glu Gin Ala Lye Ala Leu 

100 105 110 

CGT AAC CTC ATC AAG TCC ACT GTG CCT TTG ATC AAG TCC TTG GCT GAG 384 
Arg Asn Leu He Lys Ser Thr Val Pro Leu He Lys Ser Leu Ala Glu 

115 120 125 



GAG GCT GAT GCT AGC CAC CTT ATC CGT CAC GAA GGT CAC CTT ACC GTG 432 
Glu Ala Asp Ala Ser His Leu He Arg His Glu Gly His Leu Thr Val 

130 135 140 

TAC CGT GGA GAA GCA GAC TTC GCC AGG GAC CGT GGA GGT TGG GAA CTT 480 
Tyr Arg Gly Glu Ala Asp Phe Ala Arg Asp Arg Gly Gly Trp Glu Leu 

145 150 155 160 

CGT CGT CTC AAC GGT GTT CGT ACT CAA ATC CTC AGC GCT GAT GCA TTG 528 
Arg Arg Leu Asn Gly Val Arg Thr Gin He Leu Ser Ala Asp Ala Leu 

165 170 175 

CGT GAT TTC GAT CCT AAC TTG TCT CAC GCC TTT ACC AAG GGA ATC CTT 576 
Arg Asp Phe Asp Pro Asn Leu Ser His Ala Phe Thr Lys Gly He Leu 

180 185 190 

ATC GAA GAG AAC GGT CAC ACC ATC AAC CCA CAA GGT CTC GTG ACT CTC 624 
He Glu Glu Asn Gly His Thr He Asn Pro Gin Gly Leu Val Thr Leu 

195 200 205 



TTG TTT CGT CGT TTC ATC GCT AAC GGT GGA GAG TTC GTG TCT GCT CGT 672 
Leu Phe Arg Arg Phe He Ala Asn Gly Gly Glu Phe Val Ser Ala Arg 

210 215 2i0 



GTT ATC GGA TTC GAG ACT GAA GGT CGT GCT CTC AAG GGT ATC ACC ACC 720 
Val He Gly Phe Glu Thr Glu Gly Arg Ala Leu Lys Gly He Thr Thr 



225 



230 



235 



240 
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ACC AAC GGT GTT CTT GCT GTT GAT GCA GCT GTT GTT GCA GCT GGT GCA 768 
Thr Asn Gly Val Leu Ala Val Asp Ala Ala Val Val Ala Ala Gly Ala 

245 250 255 



CAC TCC AAG TCT CTT GCT AAC TCC CTT GGT GAT GAC ATC CCA TTG GAT 816 
His Ser Lys Ser Leu Ala Asn Ser Leu Gly Asp Asp lie Pro Leu Asp 

260 265 270 

ACC GAA CGT GGA TAC CAC ATC GTG ATC GCC AAC CCA GAA GCT GCT CCA 864 
Thr Glu Arg Gly Tyr His He Val He Ala Asn Pro Glu Ala Ala Pro 

275 280 285 

CGT ATT CCA ACT ACC GAT GCT TCT GGA AAG TTC ATC GCT ACT CCT ATG 912 
Arg He Pro Thr Thr Asp Ala Ser Gly Lys Phe He Ala Thr Pro Met 

290 295 300 

GAG ATG GGT CTT CGT GTT GCT GGA ACC GTT GAG TTC GCT GGT CTC ACT 960 
Glu Met Gly Leu Arg Val Ala Gly Thr Val Glu Phe Ala Gly Leu Thr 

305 310 315 320 

GCT GCT CCT AAC TGG AAG CGT GCT CAC GTT CTC TAC ACT CGC GCT CGT 1008 
Ala Ala Pro Asn Trp Lys Arg Ala His Val Leu Tyr Thr Arg Ala Arg 

325 330 335 

AAG TTG CTT CCA GCT CTC GCT CCT GCC AGT TCT GAA GAA CGT TAC TCC 1056 
Lys Leu Leu Pro Ala Leu Ala Pro Ala Ser Ser Glu Glu Arg Tyr Ser 

340 345 350 

AAG TGG ATG GGT TTC CGT CCA AGC ATC CCG GAT TCC CTT CCA GTG ATT 1104 
Lys Trp Met Gly Phe Arg Pro Ser He Pro Asp Ser Leu Pro Val He 

355 360 365 

GGT CGT GCT ACC CGT ACT CCA GAC GTT ATC TAC GCT TTC GGT CAC GGT 1152 
Gly Arg Ala Thr Arg Thr Pro Asp Val He Tyr Ala Phe Gly His Gly 

370 375 380 
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CAC CTC GGT ATG ACT GGT GCT CCA ATG ACC GCA ACC CTC GTT TCT GAG 1200 
His Leu Gly Ket Thr Gly Ala Pro Met Thr Ala Thr Leu Val Ser Glu 

385 390 395 400 

CTC CTC GCA GGT GAG AAG ACC TCT ATC GAC ATC TCT CCA TTC GCA CCA 1248 
Leu Leu Ala Gly Glu Lys Thr Ser lie Asp lie Ser Pro Phe Ala Pro 

405 410 415 

AAC CGT TTC GGT ATT GGT AAG TCC AAG CAA ACT GGT CCT GCA TCC TAA 1296 
Asn Arg Phe Gly lie Gly Lys Ser Lys Gin Thr Gly Pro Ala Ser 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ala Glu Asn His Lys Lys Val Gly He Ala Gly Ala Gly He Val 

1 5 10 15 

Gly Val Cys Thr Ala Leu Met Leu Gin Arg Arg Gly Phe Lys Val Thr 
20 25 30 

Leu He Asp Pro Asn Pro Pro Gly Glu Gly Ala Ser Phe Gly Asn Ala 
35 40 45 

Gly Cys Phe Asn Gly Ser Ser Val Val Pro Met Ser Met Pro Gly Asn 
50 55 60 

Leu Thr Ser Val Pro Lys Trp Leu Leu Asp Pro Met Gly Pro Leu Ser 
65 70 75 *0 
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He Arg Phe Gly Tyr Phe Pro Thr He Met Pro Trp Leu He Arg Phe 
85 90 95 

Leu Leu Ala Gly Arg Pro Asn Lye Val Lys Glu Gin Ala Lys Ala Leu 
100 105 110 

Arg Asn Leu He Lys Ser Thr Val Pro Leu He Lys Ser Leu Ala Glu 
115 120 125 

Glu Ala Asp Ala Ser His Leu He Arg His Glu Gly His Leu Thr Val 
130 135 140 

Tyr Arg Gly Glu Ala Asp Phe Ala Arg Asp Arg Gly Gly Trp Glu Leu 
145 150 155 160 

Arg Arg Leu Asn Gly Val Arg Thr Gin lie Leu Ser Ala Asp Ala Leu 
165 170 175 

Arg Asp Phe Asp Pro Asn Leu Ser His Ala Phe Thr Lys Gly He Leu 
180 185 190 

lie Glu Glu Asn Gly His Thr lie Asn Pro Gin Gly Leu Val Thr Leu 
195 200 205 

Leu Phe Arg Arg Phe lie Ala Asn Gly Gly Glu Phe Val Ser Ala Arg 
210 215 220 

Val He Gly Phe Glu Thr Glu Gly Arg Ala Leu Lys Gly lie Thr Thr 
225 230 235 240 



Thr Asn Gly Val Leu Ala Val Asp Ala Ala Val Val Ala Ala Gly Ala 
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245 250 255 



His Ser Lys Ser Leu Ala Asn Ser Leu Gly Asp A8p He Pro Leu Asp 
260 265 270 

Thr Olu Arg Gly Tyr His He Val lie Ala Asn Pro Glu Ala Ala Pro 
275 280 285 

Arg He Pro Thr Thr Asp Ala Ser Gly Lys Phe He Ala Thr Pro Met 
290 295 300 

Glu Met Gly Leu Arg Val Ala Gly Thr Val Glu Phe Ala Gly Leu Thr 
305 310 315 320 

Ala Ala Pro Asn Trp Lys Arg Ala His Val Leu Tyr Thr Arg Ala Arg 
325 330 335 

Lys Leu Leu Pro Ala Leu Ala Pro Ala Ser Ser Glu Glu Arg Tyr Ser 
340 345 350 

Lys trp Met Gly Phe Arg Pro Ser He Pro Asp Ser Leu Pro Val He 
355 1 360 365 



Gly Arg Ala Thr Arg Thr Pro Asp Val He Tyr Ala Phe Gly His Gly 
370 375 380 

His Leu Gly Met Thr Gly Ala Pro Met Thr Ala Thr Leu Val Ser Glu 
385 390 395 400 

Leu Leu Ala Gly Glu Lys Thr Ser He Asp He Ser Pro Phe Ala Pro 
405 410 415 



Asn Arg Phe Gly He Gly Lys Ser Lys Gin Thr Gly Pro Ala Ser 
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420 



425 



430 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGTTCTCTAC ACTCGTGCTC GTAAGTTGC 
29 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CGTTCTCTAC ACTAAGGCTC GTAAGTTGC 29 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CGTTCTCTAC ACTCAAGCTC GTAAGTTGC 
(2) INFORMATION FOR SEQ ID NO: 22: 



29 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CGTTCTCTAC ACTGCTGCTC GTAAGTTGC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CTCTACACTT GGGCTCGTAA GCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CTCTACACTA TCGCTCCTAA GCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (synthetic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CTCTACACTC TGGCTCGTAA GCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: - 
CTCTACACTG AAGCTCGTAA GCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGCTGGAGCT GGAATCGTTG GTGTATGCAC TGCTTTGATG CTTCAACGTC GTGGATTCAA 60 
AG 

62 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (synthetic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GCAGATCCTC TCTGCTGATG CTTTGCGTGA TTTCGATCCT AACTTGTCTC ATGCTTTTAC 60 

CAAGG cc 

65 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTCATCGGTT TTGAGACTGA AGGTCGTGCT CTCAAAGGCA T 
(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



TACAACCACT AACGGTGTTC TGGCTGTTGA TGCAGCTGTT GTTGCAGCTG GTGCACACTC 60 
TAAATCACT 

69 

(2) INFORMATION FOR SEQ ID NO: 31: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGAAATGGGT CTTCGTGTTG CTGGTACTGT TGAGTTTGCT GGTCTCACAG CTGCTCCTAA 



60 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TGGATGCGTT TTCGTCCTAG CATTCCTGAT TCTCTTCCAG TGATTGGTCG TGCAACTCGT 
ACACCCGA 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CGTAATCTAT GCTTTTGGTC ACGGTCATCT CGGTATGACA GGTGCTCCAA TGACTGCAAC 
TCTCGTCTC 



60 
69 
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Claims: 

» 

1. An isolated double-stranded DNA molecule 
consisting essentially of DNA encoding a glyphosate 
oxidoreductase enzyme. 



2. A recombinant, double-stranded DNA 
molecule comprising in sequence: 

a) a promoter which functions in plants to cause 
the production of an UNA sequence; 

b) a structural DNA sequence that causes the 
production of an RNA sequence which encodes 
a glyphosate oxidoreductase enzyme; and 

c) a 3' non-translated region which functions in 
plants to cause the addition of polyadenylated 
nucleotides to the 3' end of the RNA sequence 

where the promoter is heterologous with respect to 
the structural DNA sequence and adapted to cause 
sufficient expression of said enzyme in plant tissue, 
including meristematic tissue, to enhance the 
glyphosate tolerance of a plant cell transformed with 
said gene. 

3. A DNA molecule of Claim 2 in which said 
structural DNA sequence encodes a fusion polypeptide 
comprising an amino-terminal chloroplast transit peptide and 
a glyphosate oxidoreductase enzyme. 

4. A DNA molecule of Claim 3 in which the 
promoter is a plant DNA virus promoter. 
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5. A DNA molecule of Claim 4 in which the 
promoter is selected from the group consisting of CaMV35S and 
FMV35S promoters. 

6. A method of producing genetically 
transformed plants which are tolerant toward glyphosate 
herbicide, comprising the steps of: 

a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule 
comprising: 

i) a promoter which functions in plant 
cells to cause the production of an RNA 
sequence, 

ii) a structural DNA sequence that causes 
the production of an RNA sequence 
which encodes a glyphosate oxido- 
reductase enzyme, 

iii) a 3' non-translated DNA sequence which 
functions in plant cells to cause the 
addition of polyadenylated nucleotides to 
the 3 V end of the RNA sequence 
where the promoter is heterologous with respect to 
the structural DNA sequence and adapted to cause 
sufficient expression of said enzyme in plant tissue, 
including meristematic tissue, to enhance the 
glyphosate tolerance of a plant cell transformed with 
said gene; 

b) obtaining a transformed plant cell; and 

c) regenerating from the transformed plant cell a 
genetically transformed plant which has 
increased tolerance to glyphosate herbicide. 
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7. A method of Claim 6 in which said structural 
DNA sequence encodes a fusion polypeptide comprising an 
amino terminal chloroplast transit peptide and a glyphosate 
oxidoreductase enzyme. 

8. A method of Claim 7 in which the promoter is 
from a plant DNA virus. 

9. A method of Claim 8 in which the promoter is 
selected from the group consisting of CaMV35S and FMV35S 
promoters. 

10. A glyphosate tolerant plant cell comprising a 
DNA molecule of Claim 3. 

11. A glyphosate tolerant plant cell of Claim 10 in 
which the promoter is a plant DNA virus promoter. 

12. A glyphosate tolerant plant cell of Claim 11 in 
which the promoter is selected from the group consisting of 
CaMV35S and FMV35S promoters. 

13. A glyphosate tolerant plant cell of Claim 10 
selected from the group consisting of corn, wheat, rice, 
soybean, cotton, sugarbeet, oilseed rape, canola, flax, 
sunflower, potato, tobacco, tomato, alfalfa, lettuce, apple, poplar 
and pine. 

14. A glyphosate tolerant plant comprising plant 
cells of Claim 10. 



15. A glyphosate tolerant plant of Claim 14 in 
which the promoter is from a DNA plant virus promoter. 
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16. A glyphosate tolerant plant of Claim 15 in 
which the promoter is selected from the group consisting of 
CaMV35S and FMV35S promoters. 

17. A glyphosate tolerant plant of Claim 14 
selected from the group consisting of corn, wheat, rice, 
soybean, cotton, sugarbeet, oilseed rape, canola, flax, 
sunflower, potato, tobacco, tomato, alfalfa, lettuce, apple, poplar 
and pine. 

18. A method for selectively controlling weeds in 
a field containing a crop having planted crop seeds or plants 
comprising the steps of: 

a) planting said crop seeds or plants which are 
glyphosate tolerant as a result of a 
recombinant double-stranded DNA molecule 
being inserted into said crop seed or plant, 
said DNA molecule having: 

i) a promoter which functions in plants to 
cause the production of an RNA 
sequence, 

ii) a structural DNA sequence that causes 
the production of an RNA sequence 
which encodes a glyphosate oxido- 
reductase enzyme, 

Hi) a 3* non-translated DNA sequence 
which functions in plants to cause the 
addition of polyadenylated nucleotides 
to the 3 r end of the RNA sequence 
where the promoter is heterologous with 
respect to the structural DNA sequence and 
adapted to cause sufficient expression of said 
enzyme in plant tissue, including 
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meristematic tissue to enhance the 
glyphosate tolerance of a plant transformed 
with said gene; and 

o 

b) applying to said crop and weeds in said field a 
sufficient amount of glyphosate herbicide to 
control said weeds without significantly 
affecting said crop. 

10 

19. A method of Claim 18 in which said 
structural DNA sequence encodes an amino terminal 
chloroplast transit peptide and a glyphosate oxidoreductase 
enzyme. 

15 

20. A method of Claim 19 in which the crop plant 
is selected from the group consisting of corn, wheat, rice, 
soybean, cotton, sugarbeet, oilseed rape, canola, flax, 
sunflower, potato, tobacco, tomato, lettuce, apple, poplar, pine 
and alfalfa. 

20 

21. A DNA of Claim 1 which hybridizes to the 
DNA sequence ofSEQ ID NO:3. 



22- A glyphosate oxidoreductase protein 
substantially free of other bacterial proteins comprising the 
amino acid sequence as set forth in SEQ ID NO:5. 

23. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
arginine. 



24. The glyphosate oxidoreductase protein of 
claim 22 wherein the 'amino acid residue at position 334 is 
lysine. 
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25. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
glutamine. 

26. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
alanine. 

27. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
tryptophan. 

28. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
isoleucine. 



20 



25 



30 



29. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
leucine. 

30. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
glutamic acid. 

31. A method for selecting transformed plant 
tissue comprising: 

introducing a gene encoding glyphosate 
oxidoreductase into plant tissue; 

placing said plant tissue on a plant growth 
media containing glyphosate; 

selecting plant tissue which exhibits growth 
on said glyphosate containing media. 
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32. The method of claim 31 further comprising 
the step of confirming the presence of said glyphosate 
oxidoreductase gene in said plant tissue by recallusing on 
glyphosate a segment of said plant tissue exhibiting growth on 
glyphosate containing media. 

33. A recombinant bacterium containing the 
DNA of claim 1. 

34. A glyphosate oxidoreductase enzyme 
catalyzing the oxidation of glyphosate to aminomethyl- 
phosphonate and glyoxylate. 
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Sspl 

6358 TCATCAAAATATTTAGCAGCATTCCAGATTGGGTTCAA 

TCAACAAGGTAGGAGCCATATCACTTTATTCAAATTGG 

TATCGCCAAAACCAAGAAGGAACTCCCATCCTCAAAGG 

TTTGTAAGGAAGAATTCTCAGTCCAAAGCCTCAACAAG 

GTCAGGGTACAGAfTCTCCAAACCATTAGCCAAAAGCT 

ACAGGAGATCAATGAAGAATCTTCAATCAAAGTAAACT 

ACTGTTCCAGCACATGCATCATGGTCAGTAAGTTTCAG 

AAAAAGACATCCACCGAAGACTTAAAGTTAGTGGGCAT 

CTTTGAAAGTAATCTTGTCAACATCGAGCAGCTGGCTT 

GTGGGGACCAGACAAAAAAGGAATGGTGCAGAATTGTT 

AGGCGCACCTACCAAAAGCATCTTTGCCTTTATTGCAA 

AAGATAAAGCAGATTCCTCTAGTACAAGTGGGGAACAA 

AATAACGTGGAAAAGAGCTGTCCTGACAGCCCACTCAC 

TAATGCGTATGACGAACGCAGTGACGACCACAAAAGAA 

TTTTCCC7CTATATAAGAAGGCATTTCATJCCCATTTG 

AAGGATCATCAGATACTAACCAATATTTCTC 6954 

Sspl 

FIG.1 
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I NCATGGACGTCTGATCGAAATCGTCGTTACCGCAGCAAGGTAAGGCACGCCGAATTTTAT 
61 CACCTACCGCGAAACGGTGGCTAGGCAGCGAGAGACTGTCGGCTCCGCGGGAGCATCCTA 

H CMeil20') 

121 TGTCTGAGAACCACAAAAAAGTAGGCATCGCTGGAGCCGGAATCGTCGGCGTATGCACGG 

SENHKKVGIAGAGIVGVCTA 
181 CGCTGATGCTTCAGCGCCGCGGATTCAAAGTCACCTTGATTGACCCGAACCCTCCTGGCG 

LMLQRRGFKVTL I DPNPPGE 
241 AAGGTGCATCGTTTGGGAATGCCGGATGCTTCAACGGCTCATCCGTCGTCCCTATGTCCA 

GASFGNAGCFNGSSVVPMSH 
301 TGCCGGGAAACTTGACGAGCGTGCCGAAGTGGCTCCTTGACCCGATGGGGCCGTTGTCAA 

PGNLTSVPKVLLDPHGPLSI 
361 TCCGGTTCAGCTATTTTCCAACCATCATGCCCTGGTTGATTCGCTTTCTGTTAGCCGGAA 

RFSYFPTIMPWLIRFLLAGR 
421 GACCAAACAAGGTGAAGGAGCAGGCGAAAGCACTCCGCAATCTCATCAAGTCCACGGTGC 

PNKVKEQAKALRNLIKSTVP 
481 CTCTGATCAAGTCATTGGCGGAGGAGGCTGATGCGAGCCATCTGATCCGCCATGAAGGTC 

LIKSLAEEADASHLIRHEGH 
541 ATCTGACCGTATATCGTGGAGAAGCAGACTTCGCCAAGGACCGCGGAGGTTGGGAACTGC 

LTVYRGEADFAKDRGGVELR 
601 GGCGTCTCAACGGTGTTCGCACGCAGATCC7CAGCGCCGATGCGTTGCGGGATTTCGATC 

RLNGVRTQILSADALRDFDP 
SphI 

661 CGAACTTGTCGCATGCGTTTACCAAGGGCATTCTTATAGAAGAGAACGGTCACACGATTA 
NLSHAFTKGILIEENGHTIN 

EcoRI 

721 ATCCGCAAGGGCTCGTGACCCTCTTGTTTCGGCGTTTTATCGCGAACGGTGGCGAATTCG 
PQGLVTLLFRRFIANGGEFV 

781 TATCTGCGCGTGTCATCGGCTTTGAGACTGAAGGTAGGGCGCTTAAAGGCATTACAACCA 
SARVIGFETEGRALKG1TTT 

841 CGAACGGCGTTCTGGCCGTTGATGCAGCGGTTGTCGCAGCCGGCGCACACTCGAAATCAT 
NGVLAVDAAYVAAGAHSKSL 

EcoRV 



FIG.2A 
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901 TTGCTMTTCGCTAGGCGATGACATCCCGCTCGATACCGAACGTGGATATCATATCGTCA 
ANSLGDDIPLDTERGYH1 VI 
961 TCGCGAATCCGGAAGCCGCTCCACGCATTCCGACGACCGATGCGTCAGGAAAATTCATCG 
ANPEAAPRIPTTDASGKFIA 
1 021 CGACACCTATGGAAATGGGGCTTCGCGTGGCGGGTACGGTTGAGTTCGCTGGGCTCACAG 
TPHEHGLRVAGTVEFAGLTA 
1 OBI CCGCTCCTAACTGGAAACGTGCGCATGTGCTCTATACGCACGCTCGAAAACTTCTTCCAG 
APNUKRAHVLYTHARKLLPA 
1141 CCCTCGCGCCTGCGAGTTCTGAAGAACGATATTCCAAATGGATGGGGTTCCGGCCGAGCA 
LAPASSEERYSKVMGFRPS I 
1201 TCCCGGATTCGCTCCCCGTGATTGGCCGGGCAACCCGGACACCCGACGTAATCTATGCTT 
PDSLPVIGRATRTPDVIYAF 
Ncol Sad 
1261 TCGGCCATGGTCATCTCGGCATGACAGGGGCGCCGATGACCGCAACGCTCGTCTCAGAGC 
GHGHLGMTGAPMTATLVSEL 
1321 TCCTCGCAGGCGAAAAGACCTCAATCGACATTTCGCCCTTCGCACCAAACCGCTTTGGTA 
LAGEKTSIDISPFAPNRFGI 

Seal 

1381 TTGGCAAATCCAAGCAAACGGGTCCGGCAAGTTAAGTACTTACGCGGTCGTGAGTACAGC 

GKSKQTGPASki 
1441 GCAGAGCCGGTGTCAAGATCAATCTGCACCTCGCAATCACCTCGGAGACGCGAAATGGCG 
1501 CAAATAGAACACATATTAACGAGTCACGCCCCGAAGCCTTTGGGTCACTACAGTCAGGCG 
1561 GCCCGAGCGGGTGGATTCATTCATGTTTCCGGTCAGCTTCCGATCAAACCAGAAGGCCAG 
1621 TCGGAGCAATCTGACGATCTCGTCGATAACCAGGCCAGTCTCGTTCTCCGGAATTTGCTG 
Xhol 

1681 GCCGTACTCGAG 



FIG.2B 
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fMet 

I AGATCTCCATGGCTGAGAACCACAAAAAAGTA66CATCGCTGGAGCCGGA 50 
— T 

51 ATCGTCGGCGTATGCACGGCGCTGATGCTTCAGCGCCGCGGATTCAAAGT 100 
T T T TT ATT 

101 CACCTTGATTGACCCGAACCCTCCTGGCGAAGGTGCATCGTTTGGGAATG 150 



151 CCGGATGCTTCAACGGCTCATCCGTCGTCCCTATGTCCATGCCGGGAAAC 200 



201 TTGACGAGCGTGCCGAAGTGGCTCCTTGACCCGATGGGGCCGTTGTCAAT 250 



251 CCGGTTCAGCTATTTTCCAACCATCATGCCCTGGTTGATTCGCTTTCTGT 300 



301 TAGCCGGAAGACCAAACAAGGTGAAGGAGCAGGCGAAAGCACTCCGCAAT 350 



351 CTCATCAAGTCCACGGTGCCTCTGATCAAGTCATTGGCGGAGGAGGCTGA 400 



401 TGCGAGCCATCTGATCCGCCATGAAGGTCATCTGACCGTATATCGTGGAG 450 



451 AAGCAGACTTCGCCAAGGACCGCGGAGGTTGGCAACTGCGGCGTCTCAAC 500 



501 GGTGTTCGCACGCAGATCCTCAGCGCCGATGCGTTGCGGGATTTCGATCC 550 

TCT T T T 

551 GAACTTGTCGCATGCGTTTACCAAGGGCATTCTTATAGAAGAGAACGGTC 600 
T T 



FIG.3A 
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601 ACACGAnAATCCGCAAGGGCTCGTGACCCTCTTGTnCGGCGTTnATC 650 

651 6CGAACGGTGGCGAATTTGTATCTGCGC67GTCATCGGCTTTGAGACTGA 700 

T 

701 AGGTAGGGCGCTTAAAGGCATTACAACCACGAACGGCGTTCTGGCC6TTG 750 
CT T C T T T 

751 ATGCAGCGGTTGTCGCAGCCGGCGCACACTCGAAATCACTTGCTAATTCG 800 
T T IT T 

801 CTAGGCGATGACATCCCGCTCGATACCGAACGTGGATATCATATCGTCAT 850 



851 CGCGAATCCGGAAGCCGCTCCACGCATTCCGACGACCGATGCGTCAGGAA 900 

901 AAnCATCGMACACCTATGGAAATGGGGCTTCGCGTGGCGGGTACGGTT 950 

T T T T T 

951 GAGTTCGCTGGGCTCACAGCCGCTCCTAACTGGAAACGTGCGCATGTGCT 1000 
T T T 

1001 CTATACGCACGCTCGAAAACTTCTTCCAGCCCTCGCGCCTGCGAGTTCTG 1050 



1051 AAGAACGATAnCCAAATGGATGGGGTTCCGGCCGAGCATCCCGGATTCG 1100 

T T T T T T T 

1101 CTCCCCGTGATTGGCCGGGCAACCCGGACACCCGACGTAATCTATGCTTT 1150 
T A T T T T 

1151 CGGCCACGGTCATCTCGGCATGACAGGGGCGCCGATGACCGCAACGCTCG 1200 
TT T TTATT 

1201 TCTCAGAGCTCCTCGCAGGCGAAAAGACCTCAATCGACATTTCGCCCTTC 1250 



1251 GCACCAAACCGCTTTGGTATTGGCAAATCCAAGCAAACGGGTCCGGCAAG 1300 



1301 TTAAGTGGGAATTCAAGCTTG 1321 

stop RG.3B 

rnoCTITIITr QHEET 
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1 AGATCTCCATGGCTGAGAACCACAAAAAAGTAGGCATCGCTGGAGCCGGA 50 

G G T T T 

51 ATCGTt^CGTATGCACGGCGCTGATGCTTCAGCGCCGCGGATTCAAAGT 100 
T T T T TT ATT G 

101 CACCTTGATTGACCCGAACCCTCCTGGCGAAGGTGCATCGTTTGGGAATG 150 
T TA A A T TTCTC 

151 CCGGATGCTTCAACGGCTCATCCGTCGTCCCTATGTCCATGCCGGGAAAC 200 
TT TCTTA A 

201 TTGACGAGCGTGCCGAAGTGGCTCCTTGACCCGATGGGGCCGTTGTCAAT 250 
T T A T A T A C 

251 CCGGTTCAGCTATnTCCAACCATCATGCCCTGGTTGATTCGCTTTCTGT 300 
T C T T CT C 

301 TAGCCGGAAGACCAAACAAGGTGAAGGAGCAGGCGAAAGCACTCCGCAAT 350 
T T A T G T C 

351 CTCATCAAGTCCACGGTGCCTCTGATCAAGTCATTGGCGGAGGAGGCTGA 400 
T T C T 

401 TGCGAGCCATCTGATCCGCCATGAAGGTCATCTGACCGTATATCGTGGAG 450 
T C T T C C T G C 

451 AAGCAGACTTCGCCAAGGACCGCGGAGGTTGGGAACTGCGGCGTCTCAAC 500 

T T T 

501 GGTGTTCGCACGCAGATCCTCAGCGCCGATGCGTTGCGGGATTTCGATCC 550 
T T A TAT 

551 GAACTTGTCGCATGCGTTTACCAAGGGCATTCTTATAGAAGAGAACGGTC 600 
T T C C AC C 
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601 ACACGATTAATCCGCAA6GGCTCGTGACCCTCTTGTTTCGGCGTTTTATC 650 
C C C A T 7 T C 

651 GCGAACGGTGGCGAATTTGTATCTGCGCGTGTCATCGGCTTTGAGACTGA 700 
T AGCG T T AC 

701 AGGTAGGGCGCTTAAAGGCATTACAACCACGAACGGCGTTCTGGCCGTTG 750 
CTTCGTCC C T T T 

751 ATGCAGCGGTTGTCGCAGCCGGCGCACACTCGAAATCACTTGCTAATTC6 800 
T T TT CGT CC 

801 CTAGGCGATGACATCCCGCTCGATACCGAACGTGGATATCATA7CGTCAT 850 
T T AT G C C G 

851 CGCGAATCCGGAAGCCGCTCCACGCATTCCGACGACCGATGCGTCAGGAA 900 
C C A T TAT T T 

901 AAnCATCGCGACACCTATGGAAATGGGGCTTCGCGTGGCGGGTACGGTT 950 
G T T G T T T T A C 

951 GAGTTCGCTGGGCTCACAGCCGCTCCTAACTGGAAACGTGCGCATGTGCT 1000 
T T T G T C T 

1001 CTATACGCACGCTCGAAAACHCniXAGaXTCGCGCCTGCGAGnGTG 1050 
C T T GTG T T C 

1051 AAGAACGATATTCCAAATGGATGGGGTTCCGGCCGAGCATCCCGGATTCG 1100 
T C G T T A AC 

1101 CTCCCCGTGATTGGCCGGGCAACCCGGACACCCGACGTAATCTATGCTTT 1150 
T A T T T T T A T C 

1151 CGGCCACGGTCATCTCGGCATGACAGGGGCGCCGATGACCGCAACGCTCG 1200 
T CTTTTA' C 

1201 TCTCAGAGCTCCTCGCAGGCGAAAAGACCTCAATCGACATTTCGCCCTTC 1250 
T T T G T C T A 

1251 GCACCAAACCGCTTTGGTATTGGCAAATCCAAGCAAACGGGTCCGGCAAG 1300 
T C T G T T TC 

1301 TTAAGTGGGAATTCAAGCTTG 1321 
C 

FIG.4B 
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B 

0 

I 
I 
I 

AGATCTCCACAAT6GCTTCCTCTATGCTCTCTTCCGCTACTATGGTTGCCTCTCCGGCTC 
1 1 1 1 1 1 * 60 

TCTAGAGGTirrrACCGAAGGAGATACGAGAGAAGGCGATGATACCAACGGAGAGGCCGAG 

c» Me-WlftSerSerHeiLeuSerSerAlaThrMetValAlaSerProAlaGln - 

AGGCCACTATGGTCGCTCCTTTCAACGGACTTAAGTCCTCCGCTGCCTTCCCAGCCACCC 
61 1 1 1 1- y > 120 

TClOTGATAKAGCGAGGAAAtrnGCCTGAATTCAGGAGGCGACGGAAGGGTCGGTGGG 
ci A laThrHetVa IA laProPheAsnGtyLeuLysSerSerA laA laPheProA laThrArg - 

GCAAGGCTAACAAlMttnACrTCMT^ 
121 1 1 1 1 1 * 180 

CGnCCGATTGnGCTGTMTGAAGGTACTGTTCGTTGCCGCCTTCTCAATTGACGTACG 
ci LysAlaAsnAsnAspI leThrSerl leThrSerAsnGlyGlyArgVa lAsnCysHet&ln - 

AGGTGTGGCCTCCGATTGGAAAGAAGAAGTTTGAGACTCTCTCTTACCTTCCTGACC7TA 
181 1 1 1 1 < h 240 

TCCACACCGGAGGCTAACCTTTCTTCTTCAAACTCTGAGAGAGAATGGAAGGACTGGAAT 

ci ValTrpProProIleGlyLysLysLysPheGluThrLeuSerTyrLeuProAspLeuThr - 

N 
c 

0 

1 

CCGATTCCGGTGGTCGCGTCAACTGCATGCAGGCCATGG 
241 1 h- 1 279 

GGCTAAGGCCACCAGCGCAGTTGACGTACGTCCGGTACC ' 
ci AspSerGlyGlyArgValAsnCysMe-tGlnAlaHe-t 
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B 

g 
I 
I 
I 

AGATCTATCCATAAKTTGATGTAATTGGAGGAAGATCAAMTTnCAATCCCCATTCTT 
I 1 1 1 i 1 + 60 

TCTAGATAGCTAnCGMCTACATTAACCTCCTTCTAGTTTTAAAAGTTAGGGGTAAGAA 

CGATTKTTCAATTGAAGTTTCTCCGATGGCGCAAGTTAGCAGAATCTGCAATGGTGTGC 
61 F 1 1 1 — 1 + 120 

GCTMCGAAtTTTAACTTCAMGAGGCTACCGCGnCMTCGTCTTAGACGTTACCACACG 

MetAlaGlnValSerArgI leCysAsnGlyValGln - 

AGAAIXCATCTCTTATCTCCAATCTCTCGAAATCCAGTCAACGCAAATCTCCCTTATCGG 
121 1 1 1 h 1 + 180 

TCTTGGGTAGAGAATAGAGGTTAGAGAGCTTTAGGTCAGTTGCGTTTAGAGGGAATAGCC 

c« AsnProSerLeuIleSerAsnLeuSerLysSerSerGlnArgLysSerProLeuSerVal - 

TTTCTCTGAAGACGCAGCAGCATCCACGAGCTTATCCGATTTCGTCGTCGTGGGGATTGA 
181 1 1 ■ — i 1 1 h 240 

AAAGAGACTTCTGCGTCGTCGTAGGTGCTCGAATAGGCTAAAGCAGCAGCACCCCTAACT 

ci SerLeuLysThrGlnGlnHlsProAroAlQTyrProIleSerSerSerTrpGlyLeuLys - 

AGAAGAGTGGGATGACGTTAATTGGCTCTGAGCTTCGTCCTCTTAAGGTCATGTCTTCTG 
24i 1 1 1 1 1 1 

TCTTCTCACCCTACTGCAATTAACGGAGACTCGAAGCAGGAGAATTCCAGTAGAGAAGAC 
c» LysSerGlyHetThrLeulleGlySerGluLeuArgProLeuLysValMe-tSerSerVal - 

S 

P 
h 

I 

TTTCCACGGCGTGCATGC 

301 1 

AAAGGTGCCGCACGTACG 

c« SerThrAlaCysMet 
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